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5 (54) Title: METHODS FOR DIAGNOSING AND TREATING MULTIPLE SCLEROSIS AND COMPOSITIONS THEREOF 
O 

(57) Abstract: The present invention is directed to novel methods for diagnosis and prognosis of Multiple Sclerosis by identifying 
Q differentially expressed genes. Moreover, the present invention is also directed to methods that can be used to screen test compounds 

and therapies for the ability to inhibit multiple sclerosis. Additionally, methods and molecule targets (genes and their products) for 
^ therapeutic intervention in multiple sclerosis are described. 
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TITLE 

METHODS FOR DIAGNOSING AND TREATING 
MULTIPLE SCLEROSIS AND COMPOSITIONS THEREOF 

5 

[0001] This application claims benefit of U.S. 
Provisional Patent Application No. 60/280,572, filed 
March 30,2001. 

10 Field of the Invention 

[0002] The present invention is directed to novel 
methods for diagnosis and prognosis of Multiple 
Sclerosis by identifying differentially expressed 
15 genes. Additionally, methods and molecular targets 

(genes and their products) for therapeutic intervention 
in multiple sclerosis are described. 

Background of the Invention 

20 

[0003] Multiple Sclerosis (MS) is a chronic, often 
disabling autoimmune disease of the central nervous 
system that first appears in young adults, with greater 
occurrence in women than men. MS involves an 
25 inflammation of the central nervous system, in which 

lymphocytes attack myelin and oligodendrocytes to leave 
behind characteristic lesions or plaques in the brain 
and spinal cord. The involvement of complex 
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immunological pathways, in particular systemic T-cell 
mediated pathways, has presented a challenge to 
researchers attempting to elucidate the mechanism of 
the disease. 

5 

[0003] Population studies on MS have suggested that MS 
is a product of genetic susceptibility as compounded by 
environmental exposure of an unknown nature. Genetic 
susceptibility of the disease has been borne out by a 

10 correlated familial risk, showing the greatest 

recurrence among monozygotic twins, then among full- 
siblings and then cousins. Dyment et al . "Genetics of 
Multiple Sclerosis" Human Molecular Genetics, 6: 1693- 
1698 (1997). A connection to the environment has 

15 further been drawn by several-fold differences in risk 
between populations in different latitudes, with 
increasing recurrence in higher latitudes with cooler 
climates (i.e. northern Europe or Canada) . Dyment et 
al. Population studies however, have been unable to 

2 0 clearly identify all of the genes involved in MS and 
their complex interaction, nor have these studies been 
able to provide significant guidance on treatment or 
therapy for MS . 

25 [0004] The etiology of MS remains unclear, although 

numerous T-helper cell inflammatory pathways have been 
implicated in MS pathology. Due to the complex 
interaction of many genes expressed in these cell- 
mediated immune pathways, current methodologies have 

30 yielded limited success in elucidating the disease. 

Various methods such as differential display, in situ 
hybridization, reverse transcriptase-polymerase chain 
reaction or competitive polymerase chain reaction have 
been used to study the patterns of gene expression with 

35 limited success. Baranzini et al . "Transcriptional 
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Analysis of Multiple Sclerosis Brain Lesions Reveals a 
Complex Pattern of Cytokine Expression, " jr. Immunol. 
165: 6576-6582 (2000); van Boxel-Dezaire et al . 
"Decreased Interleukin- 10 and Increased Interleukin- 
5 12p40 mRNA are Associated with Disease Activity and 
Characterize Different Disease Stages in Multiple 
Sclerosis" Annals of Neurology, 45: 695-703 (1999)). A 
study by Whitney and colleagues utilized genetic 
expression data from one patient to compare MS tissue 

10 to non-MS tissue, revealing that about sixty- two genes 
were differentially expressed, including the Duffy 
chemokine receptor, interferon regulatory factor-2, and 
tumor necrosis factor alpha receptor-2 . Nonetheless, 
no method has yet been developed to provide 

15 comprehensive data on differentially expressed genes 
from multiple subjects at multiple stages of 
development, or through a wide range of geographical 
regions . 

20 [0005] Furthermore, the nature and variability of MS 
as expressed in different individuals has proven to be 
a challenge in characterizing the disease and in 
providing a prognosis for each patient. Three general 
stages or types of MS have been characterized: (1) 

25 secondary progressive, in which symptoms and disability 
gradually worsen over time as opposed to having 
discreet recognizable attacks or relapses; (2) primary 
progressive, in which there are no attacks but a 
gradual worsening from the start; and (3) relapsing - 

30 remitting, in which there is a clinical worsening of 
symptoms followed temporary improvement. The present 
invention therefore addresses these issues by using 
differentially expressed genes to provide methods for 
diagnosis and prognosis, and assays for therapeutic 

3 5 intervention. 
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SUMMARY OP THE INVENTION 

[000 6] In one embodiment, the invention provides a 
5 method of diagnosing a subject with multiple sclerosis 
by comparing the level of expression of a marker in a 
sample from a subject, where the marker is selected 
from the group of markers set forth in Tables 1-5 (for 
the animal model of multiple sclerosis) or 7-10, to the 
10 normal level of expression of the marker in a control 
sample, where a substantial difference between the 
level of expression of the marker in the sample from 
the subject and the normal level is an indication that 
the subject is afflicted with multiple sclerosis. In a 
15 preferred embodiment, the marker corresponds to a 
transcribed polynucleotide or a portion thereof. 
Preferably, the marker corresponds to a transcribed 
polynucleotide or a portion thereof, and the sample is 
collected from brain tissue or comprises peripheral 
20 blood mononuclear cells (PBMCs) . In, another preferred 
embodiment, the control sample is from non-involved 
tissue from the subject. Alternatively, the control 
sample is from the tissue of a nondiseased subject* In 
a further preferred embodiment, the level of expression 
2 5 of the marker in the sample differs from the normal 
level of expression of the marker in a subject not 
afflicted by a factor of at least two, and in an even 
more preferred embodiment, the expression levels differ 
by a factor of at least five. 
30 . . 

[0007 J In another preferred embodiment, the level of 
expression of the marker in the sample is assessed by 
detecting the presence in the sample of a protein 
corresponding to the marker. In a particularly 
35 preferred embodiment, the presence of the protein is 
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detected using a reagent which specifically binds with 
the protein. In an even more preferred embodiment, the 
reagent comprises an antibody or fragments thereof. In 
another preferred embodiment, the method comprises a 
5 marker selected from markers listed in Table 9 or 10 . 
In another preferred embodiment, the level of 
expression of the marker in the sample is assessed by 
detecting the presence in the sample of a transcribed 
polynucleotide or portion thereof, where the 
10 transcribed polynucleotide includes the marker. In a 
particularly preferred embodiment, the transcribed 
polynucleotide is an mRNA or a cDNA. 

[0008] In yet another preferred embodiment, the level 
15 of expression of the marker in the sample is assessed 
by detecting the presence in the sample of a transcribed 
polynucleotide or a portion thereof which hybridizes with a 
labeled probe under stringent conditions, wherein the 
transcribed polynucleotide comprises the marker 

20 

[0009] In another preferred embodiment for diagnosing 
a subject with multiple sclerosis, the level of 
expression in the sample of each of a panel of markers 
independently selected from the markers listed in 

25 Tables 1-5 or 7-10 is compared with the normal level of 
expression of the same panel of markers in a control 
sample, where the level of expression of more than one 
of the markers is substantially different, relative to 
the corresponding normal levels of expression of the 

3 0 markers, indicating that the subject is afflicted wi.th 
multiple sclerosis. In a particularly preferred 
embodiment, the plurality includes at least five of the 
markers set forth in Tables 1-5 or 7-10. 
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[0010] In another embodiment, the invention provides a 
method of monitoring the progression of multiple 
sclerosis in a subject, including detecting in a 
subject sample at a first point in time the expression 
5 of marker, where the marker is selected from the group 
including the markers listed in Tables 1-5 or 7-10, 
repeating this detection step at a subsequent point in 
time with the same marker, and detecting a substantial 
difference between the levels of expression, thus 

10 indicating that the subject has progressed to a 

different stage of multiple sclerosis. In a preferred 
embodiment, at least 5 markers are selected from the 
group of markers Tables 1-5 (murine) or 7-10 (human) 
and combinations thereof. In another preferred 

15 embodiment, the marker corresponds to a transcribed 
polynucleotide or portion thereof, where the 
. polynucleotide includes the marker. In a particularly 
preferred embodiment, the cells are collected from 
brain or blood tissue (PBMCs) . 

20 

[0011] In another embodiment, the invention provides a 
method of assessing the efficacy of a test compound for 
inhibiting multiple sclerosis in a subject, including 
comparing expression of a marker in a first sample 

25 obtained from the subject which is exposed to or 

maintained in the presence of the test compound, where 
the marker is selected from the group including the 
markers listed in Tables 1-5 (murine) or 7-10 (human) , 
to expression of the marker in a second sample obtained 

3 0 from the subject, "where the second sample is not 

exposed to the test compound, where a substantially 
different level of expression of the marker in the 
first sample relative to that in the second sample is 
an indication that the test compound is efficacious for 

35 inhibiting multiple sclerosis in the subject. In a 
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preferred embodiment, the first and second samples are 
portions of a single sample obtained from the subject. 
In a particularly preferred embodiment, the 
substantially different level of expression is a lower 
5 level of expression in the first sample. 

[0012] In another embodiment, the invention provides a 
method of assessing the efficacy of a therapy for 
inhibiting multiple sclerosis in a subject, the method 

10 including comparing expression of a marker in the first 
sample obtained from the subject prior to providing at 
least a portion of the therapy to the subject, where 
the marker is selected from the group including the 
markers listed in Tables 1-5 (murine) or 7-10 (human), 

15 to expression of the marker in a second sample obtained 
from the subject following provision of the portion of 
the therapy, where a substantially different level of 
expression of the marker in the second sample relative 
to the first sample, is an indication that the therapy 

20 is efficacious for inhibiting multiple sclerosis in the 
subject. In a preferred embodiment, the substantially 
different level of expression is a substantially lower 
level of expression in the second sample. In a 
particularly preferred embodiment, the method further 

25 comprises a step of comparing expression of the marker 
in a control sample, where a substantially similar 
level of expression in the second sample, relative to 
the control sample, is an additional indication that 
the test compound is efficacious for inhibiting 

30 multiple sclerosis. 

[0013] In another embodiment, the invention provides a 
method of screening test compounds for inhibitors of 
multiple sclerosis in a subject, the method including 
35 obtaining a sample including cells from a subject, 
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separately maintaining aliquots of the sample in the 
presence of a plurality of test compounds, comparing 
expression of a marker in each of the aliquots, where 
the marker is selected from the group including the 
5 markers listed in Tables 1-5 (murine) or 7-10 (human), 
and selecting one of the test compounds which induces a 
substantially different level of expression of the 
marker in the aliquot containing that test compound, 
relative to other test compounds. In a particularly 
10 preferred embodiment, the substantially different level 
of expression is a substantially lower level of 
expression. In an alternative preferred embodiment, 
the substantially different level of expression is a 
substantially enhanced level of expression. 

15 

[0014] In another embodiment, the invention provides a 
kit for diagnosing a subject with multiple sclerosis, 
including reagents for assessing expression of a marker 
selected from the group including the markers listed in 
20 Tables 1-5 (murine) or 7-10 (human) . 

[0015] In another embodiment, the invention provides a 
kit for diagnosing multiple sclerosis in a subject, the 
kit including a nucleic acid probe where the probe 
25 specifically binds with a transcribed polynucleotide 
corresponding to a marker selected from the group 
including the markers listed in Tables 1-5 (murine) or 
7-10 (human) . 

30 [0016] In another embodiment, the invention provides a 
kit for assessing the suitability of each of a 
plurality of compounds for inhibiting multiple 
sclerosis, the kit including a plurality of compounds 
and a reagent for assessing expression of a marker 
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selected from the group including the markers listed in 
Tables 1-5 (murine) or 7-10 (human) . 

[0017] In another embodiment, the invention provides a 
5 kit for diagnosing a subject with multiple sclerosis, 
the kit including an antibody which specifically binds 
with a protein corresponding to a marker selected from 
the group including the markers listed in Tables 1-5 

(murine) or 7-10 (human) . 

10 

[0018] In another embodiment, the invention provides a 
method of modulating the level of expression of a 
marker selected from the markers listed in Tables 1-5 
(murine) or 7-10 (human) , the method comprising 
15 providing to diseased cells of the subject an antisense 
oligonucleotide complementary to a polynucleotide . 
corresponding to the marker. 

[0019] in yet another embodiment, the invention 

2 0 provides a method of modulating the level of expression 

of a marker selected from the markers listed in Tables 
1-5 (murine) or 7-10 (human), the method comprising 
providing to diseased cells of a subject a protein. In 
a particularly preferred embodiment, the invention 
25 further provides a vector which comprises a 
polynucleotide encoding the protein. 

[0020] In another embodiment, the invention provides a 
method of modulating a level of expression of a marker 
30 selected from the markers listed in Tables 1-5 (murine) 
or 7-10 (human), where the method comprises providing 
to diseased cells of a subject an antibody. In a 
particularly preferred embodiment, the method further 
comprises a therapeutic moiety conjugated to the 

3 5 antibody. 
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[0021] In another preferred embodiment, the invention 
provides a method of localizing a therapeutic moiety to 
diseased tissue of a subject comprising exposing the 
tissue to an antibody which is specific to a protein 
5 encoded by a marker listed in Tables 1-5 (murine) or 7- 
10 (human) . 

[0022] In another preferred embodiment, the present 
invention provides a method of screening for a test 

10 compound capable of modulating the activity of ,a 
protein encoded from a marker listed in Tables 1-5 
(murine) or 7-10 (human) , said method comprising 
combining said protein and test compound, and 
determining the effect of said test compound on the 

15 therapeutic efficacy of said protein. 

[0023] In yet another preferred embodiment, the 
present invention provides a method of screening for a 
bioactive agent capable of interfering with the binding 
of a protein or a fragment thereof and an antibody 
which binds to said protein or fragment thereof, where 
the method combines a protein. or fragment thereof, a 
bioactive agent and an antibody which binds to the 
protein or fragment thereof, wherein the method further 
includes determining the binding of the protein or 
fragment thereof and the antibody. 

[0024] In another preferred embodiment, the present 
invention provides an antibody which specifically binds 
3 0 to a protein encoded from a marker listed in Tables 1-5 
(murine) or 7-10 (human) . In particularly preferred 
embodiment, the antibody is monoclonal and humanized. 
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[0026] In yet another preferred embodiment, the 
present invention provides a peptide encoded from 
markers listed in Tables 1-5 (murine) or 7-10 (human) . 
Furthermore, the present invention is also directed to 
5 a composition comprising the peptide. 

[0027] In an alternative embodiment, the present 
invention provides a composition capable of modulating 
an immune response in a subject, where the composition 
10 comprises a protein encoded from a marker listed in 
Tables 1-5 (murine) or 7-10 (human) and a 
pharmaceutically acceptable carrier. 

[0028] In yet another embodiment, the present 
15 invention provides a biochip comprising a panel of 
markers selected from the group of markers listed in 
Tables 1-5 (murine) or 7-10 (human) . Furthermore, in a 
particularly preferred embodiment, the markers for a 
biochip may be selected for subjects suspected of 
20 having multiple sclerosis from different stages of the 
disease: secondary progressive, primary progressive, 
relapsing-remitting. In a still another embodiment, 
the markers may be selected for a subject which is from 
a higher-risk geographical region. 

25 

[0029] Other features and advantages of the invention 
will be apparent from the following detailed 
description and claims. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

[0030] FIGURE 1A through ID depict the relative 
expression levels of four cytokines, IL-10 (FIGURE 1A; 
p<0.4), IL-8(FIGURE IB; p<0.027), IL-12p35 ( FIGURE 1C; 
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p<0.028) and IL-lp ( FIGURE ID; p<0.011), in MS-afflicted 
subjects versus nondiseased subjects, as measured in 
Example 2(D) below. The sample size was 10 MS-afflicted 
subjects and 10 nondiseased subjects. 

5 

[0031] FIGURE 2 depicts the results of the class 
predictor used in Example 2 (E) , wherein a diagnosis of 
"normal" is indicated as positive and "MS-af f licted" 
are indicated as negative. The model was based on 
10 markers listed in Table 10 , as expressed in human blood 
samples . 

DETAILED DESCRIPTION OF THE INVENTION 
[0032] The present invention provides methods for 
15 diagnosis and prognosis evaluation for multiple 

sclerosis (MS) in subjects, as well as methods and 
molecular targets for therapeutic intervention. 

[0033] In one aspect of the invention, the expression 

20 levels of genes are determined in a particular patient 
sample for which either diagnosis or prognosis 
information is desired. The level of expression of a 
number of genes simultaneously provides an expression 
profile, which is essentially a "fingerprint" of the 

25 activity of a gene or plurality of genes that is unique 
to the state of the cell. Comparison of relative 
levels of expression have been found to be indicative 
of the presence of multiple sclerosis, and as such 
permits for diagnostic and prognostic analysis. By 

30 comparing relative expression profiles of multiple 
sclerosis tissue in known different states (i.e. 
secondary progressive vs. primary progressive vs. 
relapsing-remitting) , information regarding which genes 
are important (including both up- and down-regulation 

35 of genes) in each of these states is obtained. The 
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identification of gene markers that are differentially 
expressed in diseased versus non-diseased tissue, as 
well as differential expression resulting in different 
prognostic outcomes, allows the use of this invention 
5 in a number of ways. For example, the evaluation of a 
particular treatment regime may be evaluated: will a 
particular drug act to improve the long-term prognosis 
in a particular patient? The discovery of these 
differential expression patterns for individual genes 

10 allows for screening of drug candidates with an eye to 
mimicking or altering a particular expression pattern; 
for example, screening can be done for drugs that will 
alter the MS differential expression pattern or convert 
a poor prognosis pattern to a better prognosis pattern. 

15 This may be done by making biochips comprising sets of 
the significant MS genes, which can then be used in 
these screens. These methods can also be done on the 
protein basis; that is protein expression levels of the 
MS-associated proteins can be evaluated for diagnostic 

20 and prognostic purposes or to screen test compounds. 
In addition, the markers can be administered for gene 
therapy purposes, including the administration of 
antisense nucleic acids, or proteins (including 
antibodies and other modulators thereof) administered 

25 as therapeutic drugs. 

[0034] Moreover, in a preferred embodiment, the 
relative expression levels are measured from human 
samples; however, as will be appreciated by those in 

3 0 the art, expression levels of markers from other 

organisms may be useful in animal models of disease and 
drug evaluation; thus other markers are provided, from 
vertebrates, including mammals, including rodents 
{mice, hamsters, rats, guinea pigs, etc) , primates, 

3 5 farm animals (including sheep, goats, pigs, cows, 
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horses, etc) . Markers from other organisms may be 
obtained using the techniques outlined below. 

[0035] The present invention is based f at least in 
5 part, on the identification of a number of genetic 
markers, set forth in Tables 1-5 (mouse) and 7-10 
(human) , which are differentially expressed between 
diseased samples (MS-associated) and non-diseased 
samples. Using a murine model of experimental 

10 autoimmune encephalitis ("EAE") as an analogy to human 
MS, a panel of 11,000 known murine genes was screened 
for expression in diseased versus non-diseased tissue 
from twelve different mice afflicted with the disease 
(see Example 1) . Those genes with statistically 

15 substantial differences between the diseased and normal 
tissues are identified in Tables 1-2 . This 
differential expression was observed either as an 
increase in expression (Table 1) , or a decrease in 
expression (Table 2) . In addition, to narrow the 

20 subset of diseased-related, immune -mediated genes, 

diseased cells were stimulated in vitro with a growth 
protein and .with an inhibitory compound, to yield 6 
genes which were differentially expressed: CAPN12 
(calpain 12), MT1 (metallothioneln 1), MYOIF (myosin 

25 11) , TLN (talin) , UNK_AA117532 (EST) , and UNK_AA645990 
(EST) . 

T003 6I Using the murine model as a springboard, a 
panel of 12,000 human genes was likewise screened for 

3 0 differential expression between diseased samples of the 
brain and blood versus non-diseased samples (either 
from noninvolved brain tissue or from non-diseased 
subjects) (see Example 2) . Genes that were 
differentially up- or down- regulated in MS blood 

35 (peripheral blood mononuclear cells, or "PBMCs") 
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samples are shown in Table 7, while genes that were 
differentially up- or down -regulated in MS brain 
samples are shown in Table 8. Furthermore, comparison 
of the differentially expressed genes between blood and 
5 brain samples yielded 181 genes in common. Further 
comparison of the 181 genes to the murine genes that 
were differentially regulated in vitro yielded six 
genes (shown in Table 9 with their accession numbers) ; 
EEF1D, PIM2, PRDX2, SEC24C, UNKJVJ24 AND XIP. 

10 

[0037] Included among the genes used to screen diseased 
versus non-diseased tissue in the murine panel were two 
genes known in the art to be implicated in EAE: TNF-a 
and IFN-p. These genes served as an internal control. 
15 Each of these genes were found to be substantially 

increased in expression in EAE cells as opposed to non- 
diseased cells, thus validating the method as a means 
for identifying significant genes involved in EAE 
pathology. Correspondingly, the genes which are known 

2 0 in the art to be linked to MS (listed in Table 6) may 

also serve as validation in expression studies for MS. 
Moreover, the differentially regulated genes of the 
invention, as listed in Tables 1-5 (murine) or 7-10 
(human) , have not been previously associated with EAE 
25 or multiple sclerosis. 

[0038] Accordingly, the present invention pertains to 
the use of the genes set forth in Tables 1-5 (murine) 
or 7-10 (human), the corresponding mRNA transcripts, 

3 0 and the encoded polypeptides as markers for the 

presence or risk of development of MS. These markers 
are further useful to correlate the extent and/or 
severity of disease. In particular, the present 
invention is directed to the genes set forth in Table 9 
35 (genes which were shown to be differentially regulated 
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in both murine and human models of MS) , and in Table 
10, the genes which were used in the MS class predictor 
of Example 2 (E) . Panels of the markers can be 
conveniently arrayed on solid supports, i.e. biochips 
5 for use in kits. Markers can also be useful for 

assessing the efficacy of a treatment or therapy of MS. 

[0039] In one aspect, the invention provides markers 
whose level of expression, which signifies their 

10 quantity or activity, is correlated with the presence 
of MS. The markers of the invention may be nucleic 
acid molecules (e.g., DNA, cDNA or mRNA) or peptide(s). 
Preferably the invention is performed by detecting the 
presence of a transcribed polynucleotide or a portion 

15 thereof, wherein the transcribed polynucleotide 

comprises the marker. Alternatively, detection may be 
performed be detecting the presence of a protein which 
corresponds to the marker. The markers of the 
invention are either increased or decreased in quantity 

20 or activity in MS tissue as compared to non-diseased 
tissue. For example, the gene designated ^SAAS' is 
increased in expression level in murine EAE cells,- 
relative to control cells, while the gene designated 
% E_TC3 6651_s" is decreased in expression level in 

25 murine EAE cells, relative to control cells. Both the 
presence of increased or decreased mRNA for these genes 
(and for other genes set forth in Tables 1-5 and 7-10), 
and also increased or decreased levels of the protein 
products of these genes (and other genes set forth in 

30 Tables 1-5 and 7-10) serve as markers for either EAE or 
MS. Preferably, increased or decreased levels of the 
markers of the invention are increases and decreases of 
a magnitude that are statistically substantial as 
compared to appropriate control samples (i.e., non- 
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involved tissue or from non-diseased subjects.) In 
particularly preferred embodiments, the marker is 
increased or decreased relative to control samples by 
at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, or 10- fold or 
5 more. Similarly one skilled in the art will be 
cognizant of the fact that a preferred detection 
methodology is one in which the resulting detection 
values are above the minimum detection limit of the 
methodology. 

10 

[0040] Detection and measurement of the relative 
amount of a nucleic acid or peptide marker of the 
invention may be by any method known in the art (see, 
i.e., Sambrook, J., Fritsh, E..F., and Mania t is, T. 

15 Molecular Cloning: A Laboratory Manual. 2 nd , ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989), and Current 
Protocols in Molecular Biology, eds. Ausubel et al, 
John Wiley & Sons (1992)). Typical methodologies for 

20 detection of a transcribed polynucleotide include RITA 
extraction firom a cell or tissue sample, followed by 
hybridization of a labeled probe (i.e., a complementary 
nucleic acid molecule) specific for the target RNA to 
the extracted RNA and detection of the probe (i.e. 

25 Northern blotting) . Typical methodologies for peptide 
detection include protein extraction from a cell or 
tissue sample, followed by hybridization of a labeled 
probe (i.e., an antibody) specific for the target 
protein to the protein sample, and detection of the 

3 0 probe. The label group can be a radioisotope, a 
fluorescent compound, an enzyme, or an enzyme co- 
factor. Detection of specific peptide (s) and nucleic 
acid molecules may also be assessed by gel 
electrophoresis, column chromatography, direct 

35 sequencing, or quantitative PCR (in the case of nucleic 
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acid molecules) among many other techniques well known 
to those skilled in the art, 

[0 041] In certain embodiments, the genes themselves 
5 (i.e., the DNA or cDNA) may serve as markers for MS. 

For example, the absence of nucleic acids corresponding 
to a gene (i.e. a gene from Table 2) such as by 
deletion of all or part of the gene, may be correlated 
with disease. Similarly an increase of nucleic acid 
10 corresponding to a gene (i.e. a gene from Tables 1-5 

and 7-10) , such as by duplication of the gene, may also 
be correlated with disease. 

[0042] Detection of the presence or number of copies 
15 of all or a part of a marker gene of the invention may 
be performed using any method known in the art. 
Typically, it is convenient to assess the presence 
and/or quantity of a DNA or cDNA by Southern analysis, 
in which total DNA from a cell or tissue sample is 
20 extracted, is hybridized with a labeled probe (i.e. a 
complementary DNA molecules) , and the probe is 
detected. The label group can be a radioisotope, a 
fluorescent compound, an enzyme, or an enzyme co- 
factor. Other useful methods of DNA detection and/or 
25 quantification include direct sequencing, gel 
electrophoresis, column chromatography, and 
quantitative PCR, as is known by one skilled in the 
art . 

30 [0043] The invention also encompasses nucleic acid and 
peptide molecules which are structurally different from 
the molecules described above (i.e. which have a slight 
altered nucleic acid or amino acid sequence) , but which 
have the same properties as the molecules above (e.g., 

35 encoded amino acid sequences, or which are changed only 
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in nonessential amino acid residues) . Such molecules 
include allelic variants, and are described in greater 
detail in subsection I. 

5 [0044] In another aspect, the invention provides 

markers whose quantity or activity is correlated with 
different stages of MS: secondary progressive, primary- 
progressive, relapsing-remitting . These markers are 
either increased or decreased in quantity or activity 

10 in MS tissue in a fashion that is either positively or 
negatively correlated with the degree of severity of 
the MS. A method of monitoring progression of MS in 
subjects may be devised by detecting a substantial 
difference between the levels of expression in a 

15 diseased subject at different points in time. The 

subsequent level of expression may further be compared 
to different expression profiles of various MS stages 
to confirm whether the subject has a matching profile. 
In yet another aspect, the invention provides markers 

20 whose quantity or activity is correlated with a risk in 
a subject for developing MS. For instance, the markers 
may be selected for higher-risk geographical regions. 
These markers are either increased or decreased in 
activity or quantity in direct correlation to the 

25 likelihood of the development of MS in a subject. 

[0045] Each marker may be considered individually, 
although it is within the scope of the invention to 
provide combinations of two or more markers for use in 

30 the methods and compositions of the invention to 

increase the confidence of the analysis. In another 
aspect, the invention provides panels of the markers of 
the invention. In a preferred embodiment, these panels 
of markers are selected such that the markers within 

3 5 any one panel share certain features (see Example 
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2(E)). For example, the markers of a first panel may- 
each exhibit a decrease in quantity or activity in MS * 
tissue as compared to samples from non- involved samples 
from the same subject or tissue from a non-diseased 
5 subject. Similarly, different panels of markers may be 
composed of markers from different tissues (i.e., blood 
(Table 7) or brain tissue (Table 8)), or may represent 
different components of an MS stage or type (i.e., 
secondary progressive, primary progressive, relapsing - 

10 remitting in MS humans ; or onset, peak and recovery in 
EAE mice) . Panels of the markers of the invention may 
be made by independently selecting markers from any of 
Tables 1-5 for mice, and Tables 7-10 for humans, and 
may further be provided on biochips, as discussed 

15 below. 

[0046] It will be appreciated by one skilled in the art 
that the panels of markers of the invention may 
conveniently be provided on solid supports, as a 
biochip. For example, polynucleotides may be coupled 
to an array (e.g., a biochip using GeneChip® for 
hybridization analysis), to a resin (e.g., a resin 
which can be packed into a column for column 
chromatography), or a matrix (e.g. a nitrocellulose 
matrix for northern blot analysis) . The immobilization 
of molecules complementary to the marker (s), either 
covalently or noncovalently, permits a discrete 
analysis of the presence or activity of each marker in 
a sample. In an array, for example, polynucleotides 
complementary to each member of a panel of markers may 
individually be attached to different, known locations 
on the array. The array may be hybridized with, for 
example, polynucleotides extracted from a brain sample 
from a subject. The hybridization of polynucleotides 
from the sample with the array at any location on the 
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array can be detected, and thus the presence or 
quantity of the marker in the sample can be 
ascertained. In a preferred embodiment, an array based 
on a biochip is employed. Similarly, Western analyses 
5 may be performed on immobilized antibodies specific for 
different polypeptide markers hybridized to a protein 
sample from a subject. 

[0047] It will also be apparent to one skilled in the 
10 art that the entire marker protein or nucleic acid 

molecule need not be conjugated to the biochip support; 

a portion of the marker or sufficient length for 

detection purposes (i:e., for hybridization), for 

example a portion of the marker which is 7, 10, 15, 20, 
15 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 100 or more 

nucleotides or amino acids in length may be sufficient 

for detection purposes. 

[0048] The nucleic acid and peptide markers of the 
20 invention may be isolated from any tissue or cell of a 
subject. In a preferred embodiment, the tissue is 
brain tissue. However, it will be apparent to one 
skilled in the art that other tissue samples, including 
bodily fluids such as blood, may also serve as sources 

2 5 from which the markers of the invention may be 

assessed. The tissue samples containing one or more of 
the markers themselves may be useful in the methods of 
the invention, and one skilled in the art will be 
cognizant of the methods by which such samples may be 

3 0 conveniently obtained, stored and/or preserved. 



[0049] Several markers were known prior to the 
invention to be associated with MS and are provided in 
Table 6. These markers are not to be considered as 
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markers of the invention. However, these markers may- 
be conveniently be used in combination with the markers 
of the invention (Tables 1-5 and 7-10) in the methods, 
panels and kits of the invention. 

5 

[0050] In another aspect, the invention provides 
methods of making an isolated hybridoma which produces 
an antibody useful for diagnosing a patient with MS. 
In this method, a protein corresponding to a marker of 

10 the invention is isolated (e.g., by purification from a 
cell in which it is expressed or by transcription and 
translation of a nucleic acid encoding the protein in 
vivo or in vitro using known methods) . A vertebrate, 
preferably a mammal such as a mouse, rabbit or sheep, 

15 is immunized using the isolated protein or protein 
fragment. The vertebrate may optionally (and 
preferably) be immunized at least one additional time 
with the isolated protein or protein fragment, so that 
the vertebrate exhibits a robust immune response to the 

20 protein or protein fragment. Splenocytes are isolated 
from the immunized vertebrate and fused with an 
immortalized cell line to form hybridomas, using any of 
a variety of methods well known in the art. Hybridomas 
formed in this manner are then screened using standard 

25 methods to identify one or more hybridomas which 

produce an antibody which specifically binds with the 
protein or protein fragment. The invention also 
includes hybridomas made by this method and antibodies 
made using such hybridomas. 

30 

[00 51] The invention provides methods of diagnosing MS, 
or determining the risk of developing MS. These 
methods involve isolating a sample from a subject 
(e.g., a sample containing blood cells or brain cells), 
3 5 detecting the presence, quantity and/or activity of one 
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or more markers of the invention in the sample relative 
to a second sample from a non-diseased subject, or from 
a non-involved tissue in the same subject. The levels 
of markers in the two samples are compared, and a 
5 substantial increase or decrease in one or more markers 
in the test sample indicates the presence or risk of 
presence of MS in the subject. 

[0052] The invention also provides methods of 

10 assessing the efficacy of a test compound or therapy 
for inhibiting MS in a subject. These methods involve 
isolating samples from a subject suffering from MS who 
is undergoing treatment or therapy, and detecting the 
presence, quantity, and/or activity of one or more 

15 markers of the invention in the first sample relative 
to a second sample. Where a test compound is 
administered, the first and second samples are 
preferably sub-portions of a single sample taken from 
the patient, wherein the first portion is exposed to 

20 the test compound and the second portion is not. In 
one aspect of this embodiment, the substantially 
different level of expression is a substantially lower 
level of expression in the first sample, relative to 
the second. Most preferably, the level of expression 

25 in the first sample approximates (i.e., less than a two 
fold difference from a control) the level of expression 
in a third control sample, taken from either a non- 
diseased subject or non- involved tissue. 

3 0 [0053] Where the efficacy of a therapy is being 

assessed, the first sample obtained from the subject is 
preferably obtained prior to provision of at least a 
portion of the therapy, whereas the second sample is 
obtained following provision of the portion of the 

35 therapy. The levels of markers in the samples are 
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compared, preferably against a third control sample as 
well, and correlated with the presence, risk of 
presence, or severity of MS. Most preferably, the 
level of markers in the second sample approximates the 
5 level of expression of a third control sample. By 

assessing whether expression of MS has been lessened or 
alleviated in the sample, the ability of the treatment 
or therapy to treat MS is determined. 

10 [00 54] The invention also provides a method of 

screening test compounds for inhibitors of MS, and to 
the pharmaceutical compositions comprising the test 
compounds. The method of screening comprises obtaining 
samples of diseased or involved cells, maintaining 

15 separate aliquots of the samples with a plurality of 

test compounds, and comparing expression of a marker in 
each of the aliquots to determine whether any of the 
test compounds provides a substantially different level 
of expression from a control. In addition, methods of 

20 screening may be devised by combining a test compound 
with a protein and thereby determining the effect of 
the test compound on the protein. Alternatively, the 
invention is further directed to a method of screening 
for bioactive agents capable of interfering with the 

25 binding of a protein encoded by the markers of Tables 
1-5 (murine) or 7-10 (human) , and an antibody, by 
combining the bioactive agent, protein, and antibody 
together and determining whether binding of the 
antibody and protein occurs. 

30 

[0055] Moreover, the invention is directed to 
pharmaceutical compositions comprising the test 
compound, or bioactive agent, which may further include 
a marker protein and/or nucleic acid of the invention 
35 (e.g., for those markers in Tables 1-5 or 7-10 which 
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are decreased or increased in quantity or activity in 
MS versus non-diseased tissue) , and can be formulated 
as described herein. Alternatively, these compositions 
may include an antibody which specifically binds to a 
5 marker protein of the invention and/or an antisense 
nucleic acid molecule which is complementary to a 
marker nucleic acid of the invention (e.g., for those 
markers which are increased in quantity in MS tissue) 
and can be formulated as described herein. 

10 

[0056] The invention further provides methods of 
modulating a level of expression of a marker of the 
invention, comprising administration to the diseased 
cells of the subject a variety of compositions which 
15 correspond to the markers of Tables 1-5 (murine) or 7- 
10 (human) , including proteins or antisense 
oligonucleotides. The protein may be provided to the 
diseased cells by further providing a vector comprising 
a polynucleotide encoding the protein to the cells. 

2 0 Alternatively, the expression levels of the markers of 

the invention may be modulated by providing an 
antibody, a plurality of antibodies or an antibody 
conjugated to a therapeutic moiety. Treatment with the 
antibody may further be localized to the diseased 
25 tissue. In another aspect, the invention provides 

methods for localizing a therapeutic moiety to diseased 
tissue comprising exposing the tissue to an antibody 
which is specific to a protein encoded from the markers 
of the invention'. This method may therefore provide a 

3 0 means to inhibit or enhance expression of a specific 

gene corresponding to a marker listed in Tables 1-5 or 
7-10. Where the gene is up-regulated as a result of MS 
pathology, it is likely that inhibition of MS 
progression would involve inhibiting expression of the 
35 up-regulated gene. As a corollary to this method, 
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where the gene is down-regulated, inhibition of MS 
progression would therefore likely require enhancing 
expression of the down-regulated gene. 

5 [0057] In another aspect, the invention includes 

antibodies that are specific to proteins corresponding 
to markers of the invention. Preferably the antibodies 
are monoclonal, and most preferably, the antibodies are 
humanized, as per the description of antibodies 
10 described below. 

[0058] In still another aspect of the invention, the 
invention includes peptides or proteins which are 
encoded from the markers of the invention, and to 
15 compositions thereof. 

[0059] The invention also provides kits for diagnosing 
a subject with MS, the kit comprising reagents for 
assessing expression of the markers of the invention. 

2 0 Preferably, the reagents may be an antibody or fragment 
thereof, wherein the antibody or fragment thereof 
specifically binds with a protein corresponding to a 
marker from Tables 1-5 or 7-10. Optionally, the kits 
may comprise a nucleic acid probe wherein the probe 

25 specifically binds with a transcribed polynucleotide 
corresponding to a marker selected from the group 
consisting of the markers listed in Tables 1-5 or 7-10. 

30 [0060] The invention- further provides kits for 

assessing the suitability of each of a plurality of 
compounds for inhibiting progression of MS in a 
subject. Such kits include a plurality of compounds to 
be tested, and a reagent (i.e. antibody specific to 
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corresponding proteins of the invention) for assessing 
expression of a marker listed in Tables 1-5 or 7-10. 

[0061] Modifications to the above-described 
5 compositions and methods of the invent ion , according to 
standard techniques, will be readily apparent to one 
skilled in the art and are meant to be encompassed by 
the invention, 

10 10062] To facilitate an understanding of the present 
invention, a number of terms and phrases are defined 
below: 

[0063] As used herein, the term "modulation" includes, 
15 in its various grammatical forms (e.g., "modulated" , 
"modulation" , "modulating" , etc . ) , up-regulation, 
induction, stimulation, potentiation, and/or relief of 
inhibition, as well as inhibition and/or down- 
regulation. 

20 

[0064] As used herein, the terms "polynucleotide" and 
"oligonucleotide" are used interchangeably, and include 
polymeric forms of nucleotides of any length, either 
deoxyribonucleotides or ribonucleotides, or analogs 

25 thereof. Polynucleotides may have any three- 
dimensional structure, and may perform any function, 
known or unknown. The following are non-limiting 
examples of polynucleotides: a gene or gene fragment, 
exons, introns, messenger RNA (mRNA) , transfer RNA, 

3 0 ribosomal RNA, ribozymes, cDNA, recombinant 

polynucleotides, branched polynucleotides, plasmids, 
vectors, isolated DNA of any sequence, isolated RNA of 
any sequence, nucleic acid probes, and primers. A 
polynucleotide may comprise modified nucleotides, such 
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as methylated nucleotides and nucleotide analogs. If 
present, modifications to the nucleotide structure may 
be imparted before or after assembly of the polymer. 
The sequence of nucleotides may be interrupted by non- 
5 nucleotide components. A polynucleotide may be further 
modified after polymerization, such as by conjugation 
with a labeling component. The term also includes both 
double- and single-stranded molecules. Unless 
otherwise specified or required, any embodiment of this 
10 invention that is a polynucleotide encompasses both the 
double -stranded form and each of two complementary 
single-stranded forms known or predicted to make up the 
double- stranded form. 

15 [0065] A polynucleotide is composed of a specific 
sequence of four nucleotide bases: adenine (A) ; 
cytosine(C); guanine (G) ; thymine (T) ; and uracil (U) 
for guanine when the polynucleotide is RNA. This, the 
term "polynucleotide sequence" is the alphabetical 

20 representation of a polynucleotide molecule. This 
alphabetical representation can be inputted into 
databases in a computer having a central processing 
unit and used for bioinf ormatics applications such as 
functional genomics and homology searching. 

25 

[0066] A "gene" includes a polynucleotide containing 
at least one open reading frame that is capable of 
encoding a particular polypeptide or protein after 
being transcribed and translated. Any of the 

30 polynucleotide sequences described herein may be used 
to identify larger fragments or full-length coding 
sequences of the gene with which they are associated. 
Methods of isolating larger fragment sequences are 
known to those of sill in the art, some of which are 

35 described herein. 
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100 67] A "gene product" includes an amino acid 
sequence [e.g. , peptide or polypeptide) generated when a 
gene is transcribed and translated. 

5 

[0068] As used herein, a "polynucleotide corresponds 
to" another (a first) polynucleotide if it is related 
to the first polynucleotide by any of the following 
relationships : 

10 

1) The second polynucleotide comprises the first 
polynucleotide and the second polynucleotide encodes a 
gene product . 

15 . 2) The second polynucleotide is 5' or 3 1 to the first 
polynucleotide in cDNA, RNA, genomic DNA, or fragments 
of any of these polynucleotides. For example, a second 
polynucleotide may be a fragment of a gene that 
includes the first and second polynucleotides. The 

2 0 first and second polynucleotides are related in that 
they are components of the gene coding for a gene 
product, such as a protein or antibody. However, it is 
not necessary that the second polynucleotide comprises 
or overlaps with the first polynucleotide to be 

2 5 encompassed within the definition of "corresponding to" 

as used herein. For example, the first polynucleotide 
may be a fragment of a 3 9 untranslated region of the 
second polynucleotide. The first and second 
polynucleotide may be fragments of a gene coding for a 

3 0 gene product. The second polynucleotide may be an exon 

of the gene while the first polynucleotide may be an 
intron of the gene . 
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3) The second polynucleotide is the complement of the 
first polynucleotide. 

[0069] As used herein, the term, "transcribed" or 
5 "transcription" refers to the process by which genetic 
code information is transferred from one kind of 
nucleic acid to another, and refers in particular to 
the process by which a base sequence of mRNA is 
synthesized on a template of cDNA. 

10 

[0070] A "probe" when used in the context of 
polynucleotide manipulation includes an oligonucleotide 
that is provided as a reagent to detect a target 
present in a sample of interest by hybridizing with the 

15 target . Usually, a probe will comprise a label or a 
means by which a label can be attached, either before 
or subsequent to the hybridization reaction. Suitable 
labels include, but are not limited to radioisotopes, 
f luorochromes, chemiluminescent compounds, dyes, and 

2 0 proteins, including enzymes. 

[0071] A "primer" includes a short polynucleotide, 
generally with a free 3 1 -OH group that binds to a 
target or "template" present in a sample of interest by 

25 hybridizing with the target, and thereafter promoting 
polymerization of a polynucleotide complementary to the 
target. A "polymerase chain reaction" ("PCR") is a 
reaction in which replicate copies are made of a target 
polynucleotide using a "pair of primers" or "set or 

30 primers" consisting of "upstream" and a "downstream" 

primer, and a catalyst of polymerization, such as a DNA 
polymerase, and typically a thermally-stable polymerase 
enzyme. Methods for PCR are well known in the art, and 
are taught, for example, in MacPherson et al . , IRL 

35 Press at Oxford University Press (1991) ) . All 
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processes of producing replicate copies of a 
polynucleotide, such as PCR or gene cloning, are 
collectively referred to herein as "replication" . 
A primer can also be used as a probe in hybridization 
5 reactions, such as Southern or Northern blot analyses 
(see, e.g., Sambrook, J., Fritsh, E.F., and 
Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed.. Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) . 

10 

[0072] The term "cDNAs" includes complementary DNA, 
that is mRNA molecules present in a cell or organism 
made into cDNA with an enzyme such as reverse 
transcriptase. A * cDNA library" includes a collection 

15 of mRNA molecules present in a cell or organism, 

converted into cDNA molecules with the enzyme reverse 
transcriptase, then inserted into "vectors" (other DNA 
molecules that can continue to replicate after addition 
of foreign DNA) . Exemplary vectors for libraries 

2 0 include bacteriophage, viruses that infect bacteria 

{e.g.., lambda phage) . The library can then be probed 
for the specific cDNA (and thus mRNA) of interest. 

[0073] A "gene delivery vehicle" includes a molecule 

2 5 that is capable of inserting one or more 

polynucleotides into a host cell. Examples of gene 
delivery vehicles are liposomes, biocompatible 
polymers, including natural polymers and synthetic 
polymers ; lipoproteins ; polypeptides ; polysaccharides ; 

30 lipopolysaccharides; artificial viral envelopes; metal 
particles; and bacteria, viruses and viral vectors, 
such as baculovirus, adenovirus, and retrovirus, 
bacteriophage, cosmid, plasmid, fungal vector and other 
recombination vehicles typically used in the art which 
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have been described for replication and/or expression 
in a variety of eukaryotic and prokaryotic hosts. The 
gene delivery vehicles may be used for replication of 
the inserted polynucleotide, gene therapy as well as 
5 for simply polypeptide and protein expression. 

[0074] A Vector" includes a self -replicating nucleic 
acid molecule that transfers an inserted polynucleotide 
into and/or between host cells. The term is intended 

10 to include vectors that function primarily for 

insertion of a nucleic acid molecule into a cell, 
replication vectors that function primarily for the 
replication of nucleic acid and expression vectors that 
function for transcription and/or translation of the 

15 DNA or RNA. Also intended are vectors that provide 
more than one of the above function. 

[0075] A "host cell" is intended to include any 
individual cell or cell culture which can be or has 

2 0 been a recipient for vectors or for the incorporation 

of exogenous. nucleic acid molecules, polynucleotides 
and/or proteins. It also is intended to include 
progeny of a single cell . The progeny may not 
necessarily be completely identical {in morphology or 
25 in genomic or total DNA complement) to the original 
parent cell due to natural, accidental, or deliberate 
mutation. The cells may be prokaryotic or eukaryotic, 
and include but are not limited to bacterial cells, 
yeast cells, insect cells, animal cells, and mammalian 

3 0 cells, e.cj., murine, rat, simian or human cells. 

[0076] The term "genetically modified" includes a cell 
containing and/or expressing a foreign gene or nucleic 
acid sequence which in turn modifies the genotype or 
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phenotype of the cell or its progeny. This term 
includes any addition, deletion, or disruption to a 
cell's endogenous nucleotides. 

5 [0077] As used herein, "expression" includes the 

process by which polynucleotides are transcribed into 
mRNA and translated into peptides, polypeptides, or 
proteins. If the polynucleotide is derived from 
genomic DNA, expression may include splicing of the 

10 mRNA, if an appropriate eukaryotic host is selected. 
Regulatory elements required for expression include 
promoter sequences to bind RNA polymerase and 
transcription initiation sequences for ribosome 
binding. For example, a bacterial expression vector 

15 includes a promoter such as the lac promoter and for 
transcription initiation the Shine -Dalgarno sequence 
and the start codon AUG (Sambrook, J., Fritsh, E. F . , 
and Maniatis, T. Molecular Cloning: A Laboratory 
Manual. 2nd, ed., Cold Spring Harbor Laboratory, Cold 

20 Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989) . Similarly, a eukaryotic expression vector 
includes a heterologous or homologous promoter for RNA 
polymerase II, a downstream polyadenylation signal, the 
start codon AUG, and a termination codon for detachment 

25 of the ribosome. Such vectors can be obtained 

commercially or assembled by the sequences described in 
methods well known in the art, for example, the methods 
described below for constructing vectors in general. 

30 [0078] "Differentially expressed" , as applied to a 
gene, includes the differential production of mRNA 
transcribed from a gene or a protein product encoded by 
the gene. A differentially expressed gene may be 
overexpressed or underexpressed as compared to the 

35 expression level of a normal or control cell. In one 
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aspect, it includes a differential that is 2 times, 
preferably 5 times or preferably 10 times higher or 
lower than the expression level detected in a control 
sample. The term "differentially expressed" also 
5 includes nucleotide sequences in a cell or tissue which 
are expressed where silent in a control cell or not 
expressed where expressed in a control cell. 

[0079] The term "polypeptide" includes a compound of 
10 two or more subunit amino acids, amino acid analogs, or 
peptidomimetics . The subunits may be linked by peptide 
bonds . In another embodiment , the subunit may be 
linked by other bonds, e.g., ester, ether, etc. As 
used herein the term "amino acid" includes either 
15 natural and/or unnatural or synthetic amino acids, 

including glycine and both the D or L optical isomers, 
and amino acid analogs and peptidomimetics . A peptide 
of three or more amino acids is commonly referred to as 
an oligopeptide. Peptide chains of greater than three 
2 0 or more amino acids are referred to as a polypeptide or 
a protein. 

[0080] "Hybridization" includes a reaction in which 
one or more polynucleotides react to form a complex 

2 5 that is stabilized via hydrogen bonding between the 

bases of the nucleotide residues. The hydrogen bonding 
may occur by Watson-Crick base pairing, Hoogstein 
binding, or in any other sequence-specific manner. The 
complex may comprise two strands forming a duplex 

30 structure, there or more strands forming a multi- 
stranded complex, a single self -hybridizing strand, or 
any combination of these. A hybridization reaction may 
constitute a step in a more extensive process, such as 
the initiation of a PCR reaction, or the enzymatic 

35 cleavage of a polynucleotide by a ribozyme. 
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[0081] Hybridization reactions can be performed under 
conditions of different "stringency" . The stringency 
of a hybridization reaction includes the difficulty 
5 with which any two nucleic acid molecules will 

hybridize to one another. The present invention also 
includes polynucleotides capable of hybridizing under 
reduced stringency conditions, more preferably 
stringent conditions, and most preferably highly 

10 stringent conditions, to polynucleotides described 

herein. Examples of stringency conditions are shown in 
Table A below: highly stringent conditions are those 
that are at least as stringent as, for example, 
conditions A-F; stringent conditions are at least as 

15 stringent as, for example, conditions G-L; and reduced 
stringency conditions are at least as stringent as, for 
example, conditions M-R. 
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Table A. Stringency Conditions 



Stringency 
Condition 


Poly- 
nucleotide 
Hybrid 


Hybrid 
Lenoth (bp) 1 


Hybridization Temperature 
and Buffer 8 


Wash 

Temperature and 
Buffer 14 


A 


DNA:DNA 


>50 


65°C; IxSSC -or- 
42°C; IxSSC, 50% 
forma mide 


65°C; 0.3xSSC 


B 


DNA:DNA 


<50 


T«*; 1xSSC 


T B *; 1xSSC 


C 


DNARNA 


>50 


67°C; 1xSSC -or- 
45°C; IxSSC, 50% 
formamide 


67°C; 0.3XSSC 


D 


DNA:RNA 


<50 


T n *; 1xSSC 


T n *; 1xSSC 


E 


RNARNA 


>50 


70°C; 1xSSC -or- 
50°C; IxSSC, 50% 
formamide 


70°C; 0.3xSSC 


F 


RNA.RNA 


<50 


T F *; 1xSSC 


T,*; 1xSSC 


G 


DNADNA 


>50 


65°C; 4xSSC -or- 
42°C; 4xSSC, 50% 
formamide 


65°C; IxSSC 


H 


DNA:DNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


1 


DNArRNA 


>50 


67°C;4xSSC -or- 
45"C; 4xSSC, 50% 
formamide 


67°C; IxSSC 


J 


DNA:RNA 


<50 


T,*; 4xSSC 


T/; 4xSSC 


K 


RNARNA 


>50 


70°C; 4xSSC -or- 
50°C; 4xSSC, 50% 
formamide 


67°C; IxSSC 


L 


RNA:RNA 


<50 


T,*;2xSSC ] 


T t *; 2xSSC 


M 


DNA:DNA 


>50 


50 °C; 4xSSC -or- 
40°C; 6xSSC, 50% 
formamide 


50°C; 2xSSC 


N 


DNArDNA 


<50 


T M *; 6xSSC 


T N *; 6xSSC | 


O 


DNA:RNA 


>50 


55°C; 4xSSC -or- 
42°C; 6xSSC, 50% 
formamide 


55°C; 2xSSC 


P 


DNA:RNA 


<50 


T p *; 6xSSC 


T P *; 6xSSC 


Q 


RNAiRNA 


>50 


60 °C; 4xSSC -or- 
45°C; 6xSSC, 50% 
formamide 


60°C; 2xSSC 


R 


RNARNA 


<50 


T R *; 4xSSC 


T R *; 4xSSC 



1: The hybrid length is that anticipated for the hybridized region(s) of the hybridizing 
25 polynucleotides. When hybridizing a polynucleotide to a target polynucleotide of unknown 
sequence, the hybrid length is assumed to be that of the hybridizing polynucleotide. When 
polynucleotides of known sequence are hybridized, the hybrid length can be determined by aligning 
the sequences of the polynucleotides and identifying the region or regions of optimal sequence 
complementarity. 

30 

H : SSPE {IxSSPE is 0.1 5M NaCI, 10mM NaH 2 PO<, and 1.25mM EDTA, pH 7.4) can be substituted 
for SSC (IxSSC is 0.15M NaCI and 15mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 15 minutes after hybridization is complete. 

3 5 T B * - T R *: The hybridization temperature for hybrids anticipated to be less than 50 base pairs in 

length should be 5-1 0°C less than the melting temperature (TJ of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 18 base pairs in length, 
T m (°C) = 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 49 base pairs in 
length, T m (°C) a 81.5 + 16.6(Iog 10 Na + ) + 0.41(%G+C) - (600/N), where N is the number of bases in 

4 0 the hybrid, and Na + is the concentration of sodium ions in the hybridization buffer (Na + for IxSSC = 

0.165 M). 
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Additional examples of stringency conditions for 
polynucleotide hybridization are provided in Sambrook, 
J., E.F. Fritsch, and T. Maniatis, 1989, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor 
5 Laboratory Press, Cold Spring Harbor, NY, chapters 9 
and 11, and Current Protocols in Molecular Biology , 
1995, F.M. Ausubel et al . , eds . , John Wiley & Sons, 
Inc., sections 2.10 and 6.3-6.4, incorporated herein by 
reference. 

10 

[0082] When hybridization occurs in an antiparallel 
configuration between two single-stranded 
polynucleotides, the reaction is called "annealing" and 
those polynucleotides are described as "complementary" . 

15 A double -stranded polynucleotide can be "complementary" 
or "homologous" to another polynucleotide, if 
hybridization can occur between one of the strands of 
the first polynucleotide and the second. 
"Complementarity" or "homology" (the degree that one 

20 polynucleotide is complementary with another) is 

quantifiable in terms of the proportion of bases in 
opposing strands that are expected to hydrogen bond 
with each other, according to generally accepted base- 
pairing rules . 

25 

[0083] An "antibody" includes an immunoglobulin 
molecule capable of binding an epitope present on an 
antigen. As used herein, the term encompasses not only 
intact immunoglobulin molecules such as monoclonal and 

30 polyclonal antibodies, but also anti-idotypic 

antibodies, mutants, fragments, fusion proteins, 
bi-specific antibodies, humanized proteins, and 
modifications of the immunoglobulin molecule that 
comprises an antigen recognition site of the required 

35 specificity. 
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[0084] As used herein, the term "diseased" refers to 
cells, tissues or samples from a subject afflicted with 
multiple sclerosis, wherein the cell, tissue or sample 
5 has been affected by multiple sclerosis (i.e. from 
white-matter lesion) . As used herein, the term "non- 
diseased" refers to cells, tissues or other such 
samples taken from a subject who is not afflicted with 
multiple sclerosis. As used herein, "non- involved" 

10 refers to cells, tissues, or samples wherein the tissue 
is from a subjected afflicted with MS, but wherein the 
cells, tissues or samples are believed to be unaffected 
by multiple sclerosis. Preferred tissue (and cell) 
samples are from brain, blood, sera, lymph, thymus, 

15 spleen, bone marrow or pus. Most preferred samples are 
peripheral blood mononuclear cells ( n PBMC" ) or brain 
tissue . 

[0085J As used herein, the term "marker" includes a 
polynucleotide or polypeptide molecule which is present 

2 0 or absent, or increased or decreased in quantity or 

activity in subjects afflicted with multiple sclerosis, 
or in MS-associated cells. The relative change in 
quantity or activity of the marker is correlated with 
the incidence or risk of incidence of multiple 

25 sclerosis. 

[0086] As used herein, the term "panel of markers" 
includes a group of markers, the quantity or activity 
of each member of which is correlated with the 

30 incidence or risk of incidence of a MS-associated 

condition. In certain embodiments, a panel of markers 
may include only those markers which are either 
increased or decreased in quantity or activity in 
subjects afflicted with or cells involved in a MS- 

35 associated condition. In a preferred embodiment, the 
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panel of markers comprises at least 5 markers, and most 
preferably, the panel comprises markers listed in Table 
10. In other embodiments, a panel of markers may 
include only those markers present in a specific tissue 
5 type which are correlated with the incidence of risk of 
incidence of a MS -associated condition. 

Various aspects of the invention are described iii 
further detail in the following subsections: 

10 

I. Isolated Nucleic Acid Molecules 

[0087] One aspect of the invention pertains to 
isolated nucleic acid molecules that either themselves 
15 are the genetic markers (e.g., mRNA) of the invention, 
or which encode the polypeptide markers of the 
invention, or fragments thereof. Another aspect of the 
invention pertains to isolated nucleic acid fragments 
sufficient for sue as hybridization probes to identify 
2 0 the nucleic acid molecules encoding the markers for the 
invention in a sample, as well as nucleotide fragments 
for use as PCR primers of the amplification or mutation 
of the nucleic acid molecules which encode the markers 
of the invention. As used herein, the term "nucleic 
25 acid molecule" is intended to include DNA molecules 
(e.g., cDNA or genomic DNA) and RNA molecules (e.g., 
mRNA) and analogs of the DNA or RNA generated using 
nucleotide analogs. The nucleic acid molecule can be 
single- stranded or double- stranded, but preferably is 
•30 double- stranded DNA.. 

[0088J The term "isolated nucleic acid molecule" 
includes nucleic acid molecules which are separated 
from other nucleic acid molecules which are present in 
35 the natural source of the nucleic acid. For example, 
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with regards to genomic DNA, the term "isolated" 
includes nucleic acid molecules which are separated 
from the chromosome with which the genomic DNA is 
naturally associated. Preferably, an "isolated" 
5 nucleic acid is free of sequences which naturally flank 
the nucleic acid (i.e., sequences located at the 5 1 and 
3 1 ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For 
example, in various embodiments, the isolated marker 

10 nucleic acid molecule of the invention, or nucleic acid 
molecule encoding a polypeptide marker of the 
invention, can contain less than about 5 kb, 4 kb, 3 
kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide 
sequences which naturally flank the nucleic acid 

15 molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" 
nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or 
culture medium when produced by recombinant techniques, 

2 0 or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

[0089] A nucleic acid molecule of the present 
invention, e.g., a nucleic acid molecule having the 

25 nucleotide sequence of one of the genes set forth in 
Tables 1-5 or 7-10, or a portion thereof, can be 
isolated using standard molecular biology techniques 
and the sequence information provided herein. Using 
all or portion of the nucleic acid sequence of one of 

30 the genes set forth in Tables 1-5 or 7-10 as a 

hybridization probe, a marker gene of the invention or 
a nucleic acid molecule encoding a polypeptide marker 
of the invention can be isolated using standard 
hybridization and cloning techniques (e.g., as 

35 described in Sambrook, J., Fritsh, E. F . , and Maniatis, 
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T. Molecular Cloning: A Laboratory Manual 2nd, ed., 
Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold spring Harbor, NY, 1989) . 

5 [0090] A nucleic acid of the invention can be 

amplified using cDNA, mRNA or alternatively, genomic 
DNA, as a template and appropriate oligonucleotide 
primers according to standard PCR amplification 
techniques. The nucleic acid so amplified can be 

10 cloned into an appropriate vector and characterized by 
DNA sequence analysis. Furthermore, oligonucleotides 
corresponding to marker nucleotide sequences, or 
nucleotide sequences encoding a marker of the invention 
can be prepared by standard synthetic techniques, e.g., 

15 using an automated DNA synthesizer. 

[0091] In another preferred embodiment, an isolated 
nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 

20 nucleotide sequence of a marker of the invention {e.g., 
a gene set forth in Tables 1-5 or 7-10) , or a portion 
of any of these nucleotide sequences. A nucleic acid 
molecule which is complementary to such a nucleotide 
sequence is one which is sufficiently complementary t 

25 the nucleotide sequence such that it can hybridize to 
the nucleotide sequence, thereby forming a stable 
duplex. 

[0092] The nucleic acid molecule of the invention, 
30 moreover, 'can comprise only a portion of the nucleic 
acid sequence of a marker nucleic acid of the 
invention, or a gene encoding a marker polypeptide of 
the invention, for example, a fragment which can be 
used as a probe or primer. The probe/primer typically 
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comprises substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 
nucleotide sequence that hybridizes under stringent 
conditions to at least about 7 or 15, preferably about 
5 20 or 25, more preferably about 50 , 75, 100, 125, 150, 
175, 200, 225, 250, 275, 300, 325, 350, 400 or more 
consecutive nucleotides of a marker nucleic acid, or a 
nucleic acid encoding a marker polypeptide of the 
invention. 

10 

[0093 J Probes based on the nucleotide sequence of a 
marker gene or of a nucleic acid molecule encoding a 
marker polypeptide of the invention can be used to 
detect transcripts or genomic sequences corresponding 

15 to the marker gene(s) and/or marker polypeptide (s) of 
the invention. In preferred embodiments, the probe 
comprises a label group attached thereto, e.g., the 
label group can be a radioisotope, a fluorescent 
compound, an enzyme, or an enzyme co-factor- Such 

20 probes can be used as a part of a diagnostic test kit 
for identifying cells or tissue which misexpress {e.g., 
over- or under- express) a marker polypeptide of the 
invention, or which have greater or fewer copies of a 
marker gene of the invention. For example, a level of 

25 a marker polypeptide-encoding nucleic acid in a sample 
of cells from a subject may be detected, the amount of 
mRNA transcript of a gene encoding a marker polypeptide 
may be determined, or the presence of mutations or 
deletions of a marker gene of the invention may be 

30 assessed. 

[00 94] The invention further encompasses nucleic acid 
molecules that differ from the nucleic acid sequences 
of the genes set forth in Tables 1-5 or 7-10, due to 
35 degeneracy of the genetic code and which thus encode 
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the same proteins as those encoded by the genes shown 
in Tables 1-5 or 7-10. 

[0095] In addition to the nucleotide sequences of the 
5 genes set forth in Tables 1-5 or 7-10, it will be 
appreciated by those skilled in the art that DNA 
sequence polymorphisms that lead to changes in the 
amino acid sequences of the proteins encoded by the 
genes set forth in Tables 1-5 or 7-10 may exist within 

10 a population e.g., the human population). Such genetic 
polymorphism in the genes set forth in Tables 1-5 or 7- 
10 may exist among individuals within a population due 
to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given 

15 genetic locus. In addition it will be appreciated that 
DNA polymorphisms that affect RNA expression levels can 
also exist that may affect the overall expression level 
of that gene e.g., by affecting regulation or 
degradation) . As used herein, the phrase "allelic 

2 0 variant" includes a nucleotide sequence which occurs ta 

a given locus or to a polypeptide encoded by the 
nucleotide sequence. As used herein, the terms "gene" 
and "recombinant gene" refer to nucleic acid molecules 
which include an open reading frame encoding a marker 
25 polypeptide of the invention. 

[0096] Nucleic acid molecules corresponding to natural 
allelic variants and homologues of the marker genes, or 
genes encoding the marker proteins of the invention can 

3 0 be isolated based on their homology to the genes set 

forth in Tables 7-10, using the cDNAs disclosed herein, 
or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under 
stringent hybridization conditions. Nucleic acid 
35 molecules corresponding to natural allelic variants and 
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homologues of the marker genes of the invention can 
further be isolated by mapping to the same chromosome 
or locus as the marker genes or genes encoding the 
marker proteins of the invention. 

5 

[0097] In another embodiment, an isolated nucleic acid 
molecule of the invention is at least 15, 20 , 25, 30, 
SO, 100, ISO, 200, 250, 300, 350, 400, 450, 500, 550, 
600, 650, 700, 750, 800, 850, 900, 950, 1000, 1100, 

10 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000 or 
more nucleotides in length and hybridizes under 
stringent conditions to a nucleic acid molecule 
corresponding to a nucleotide sequence of a marker gene 
or gene encoding a marker protein of the invention. As 

15 used herein, the term "hybridizes under stringent 
conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide 
sequences at least 60% homologous to each other 
typically remain hybridized to each other. Preferably, 

20 the conditions are such that sequences at least about 
70%, more preferably at least about 80%, even more 
preferably at least about 85% or 90% homologous to each 
other typically remain hybridized to each other. Such 
stringent conditions are known to those skilled in the 

25 art and can be found in Current Protocols in Molecular 
Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. 
Preferably, an isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to 
the sequence of one of the genes set forth in Tables 1- 

30 5 or 7-10 corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" 
nucleic acid molecule includes an RNA or DNA molecule 
having a nucleotide sequence that occurs in nature 
(e.g*., encodes a natural protein). 



35 
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[0098] In addition to naturally-occurring allelic 
variants of the marker gene and gene encoding a marker 
protein of the invention sequences that may exist in 
the population, the skilled artisan will further 
5 appreciate that changes can be introduced by mutation 
into the nucleotide sequences of the marker genes or 
genes encoding the marker proteins of the invention, 
thereby leading to changes in the amino acid sequence 
of the encoded proteins, without altering the 

10 functional activity of these proteins. For example, 
nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues 
can be made. A "non-essential" amino acid residue is a 
residue that can be altered from the wild-type sequence 

15 of a protein without altering the biological activity, 
whereas an "essential" amino acid residue is required 
for biological activity. For example, amino acid 
residues that are conserved among allelic variants or 
homologs of a gene (e.g. , among homologs of a gene from 

20 different species) are predicted to be particularly 
unamenable to alteration. 

[0099] Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding a marker 

25 protein of the invention that contain changes in amino 
acid residues that are not essential for activity. 
Such proteins differ in amino acid sequence from the 
marker proteins encoded by the genes set forth in 
Tables 1-5 or 7-10, yet retain biological activity. In 

30 one embodiment, the protein comprises an amino acid 
sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 
90%, 95%, 98% or more homologous to a marker protein of 
the invention. 
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[0100] An isolated nucleic acid molecule encoding a 
protein homologous to a marker protein of the invention 
can be created by introducing one or more nucleotide 
substitutions, additions or deletions into the 
5 nucleotide sequence of the gene encoding the marker 
protein, such that one or more amino acid 
substitutions, additions or deletions are introduced 
into the encoded protein. Mutations can be introduced 
into the genes of the invention (e.g., a gene set forth 
10 in Tables 7-10) by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. 
Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid 
residues. A "conservative amino acid substitution" is 
15 one in which the amino acid residue is replaced with an 
amino acid residue having a similar side chain. 
Families of amino acid residues having similar side 
chains have been defined in the art. These families 
include amino acids with basic side chains {e.g., 
20 lysine, arginine, histidine) , acidic side chains (e^., 
aspartic acid, glutamic acid) , uncharged polar side 
chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains 
(e.g., alanine, valine, leucine, isoleucine, proline, 
25 phenylalanine, methionine, tryptophan), beta-branched 
side chains (e.g., threonine, valine, isoleucine) and 
aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine) . Alternatively, mutations can 
be .introduced randomly along all or part of a coding 
3 0 sequence of a gene of the invention, such as by 

saturation mutagenesis, and the resultant mutants can 
be screened for biological activity to identify mutants 
that retain activity. Following mutagenesis, the 
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encoded protein can be expressed recombinantly and the 
activity of the protein can be determined. 

[0101] Another aspect of the invention pertains to 
5 isolated nucleic acid molecules which are antisense to 
the marker genes and genes encoding marker proteins of 
the invention. An "antisense" nucleic acid comprises a 
nucleotide sequence which is complementary to a "sense" 
nucleic acid encoding a protein, e.g., complementary to 

10 the coding strand of a double -stranded cDNA molecule or 
complementary to an mRNA sequence. Accordingly, an 
antisense nucleic acid can hydrogen bond to a sense 
nucleic acid. The antisense nucleic acid can be 
complementary to an entire coding strand of a gene of 

15 the invention (e.g., a gene set forth in Tables 1-5 or 
7-10), or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is 
antisense to a "coding region" of the coding strand of 
a nucleotide sequence of the invention. The term 

20 "coding region" includes the region of the nucleotide 
sequence comprising codons which are translated into 
amino acid. In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding 
region" of the coding strand of a nucleotide sequence 

25 of the invention. 

[0102] The term "noncoding region" includes 5 1 and 3» 
sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 
30 5 f and 3 1 untranslated regions). 

[0103] Antisense nucleic acids of the invention can be 
designed according to the rules of Watson and Crick 
base pairing. The antisense nucleic acid molecule can 
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be complementary to the entire coding region of an mRNA 
corresponding to a gene of the invention, but more 
preferably is an oligonucleotide which is antisense to 
only a portion of the coding or noncoding region. An 
5 antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 
length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 

10 For example, an antisense nucleic acid (e.g., an 
antisense oligonucleotide) can be chemically 
synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase 

15 the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g'., 
phosphorothioatc derivatives and acridine. substituted 
nucleotides can be used. Examples of modified 
nucleotides which can be used to generate the antisense 

20 nucleic acid include 5-f luorouracil, 5-bromouracil, 5- 
chlorouracil,. 5 -iodouracil , hypoxanthine , xantine, 4- 
acetylcytosine, 5- (carboxyhydroxyhnethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 

5 -carboxymethylaminomethyluracil , dihydrouracil , beta- 
25 D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2, 2-dimethylguanine, . 

2 - methyladenine , 2 -me thylguanine , 3 -me thylcytosine , 5 - 
methyl cytosine, N6-adenine, 7 -me thylguanine, 5- 
methylaminomethyluracil, 5 -methoxyaminome thyl -2 - 

30 thiouracil, beta-D-mannosylqueosine , 5'- 

methoxycarboxymethyluracil , 5-methoxyuracil , 2 - 
methylthio-N6 - isopentenyladen4exine , unacil - 5 -oxyacet ic 
acid (v) , wybutoxosine, pseudouracil , queosine, 2- 
thiocytosine, 5 -methyl -2 -thiouracil , 2 -thiouracil , 4- 

35 thiouracil, 5-methyluracil , uracil-5- oxyacetic acid 
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methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3 -amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and .2 , 6-diaminopurine . Alternatively, the 
antisense nucleic acid can be produced biologically 
5 using an expression vector into which a nucleic acid 
has been subcloned in an antisense orientation (i.e., 
RNA transcribed from the inserted nucleic acid will be 
of an antisense orientation to a target nucleic acid of 
interest, described further in the following 
10 subsection) . 

[0104] The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 

15 to cellular mRNA and/or genomic DNA encoding a marker . 
protein of the invention to thereby inhibit expression 
of the protein, e.g. , by inhibiting transcription 
and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a 

20 stable duplex, or, for example, in the cases of an 
antisense nucleic acid molecule which binds to DNA 
duplexes, through specific interactions in the major 
groove of the double helix. An* example of a route of 
administration of antisense nucleic acid molecules of 

25 the invention include direct injection at a tissue site 
(e.g., in brain). Alternatively, antisense nucleic 
acid molecules can be modified to target selected cells 
and then administered systemically . For example, for 
systemic administration, antisense molecules can be 

30 modified such that they specifically bind to receptors 
or antigens expressed on a selected cell surface, e.g., 
by linking the antisense nucleic acid molecules to 
peptides or antibodies which bind to cell surface 
receptors or antigens. The antisense nucleic acid 

35 molecules can also be delivered to cells using the 
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vectors described herein. To achieve sufficient 
intracellular concentrations of the antisense 
molecules, vector constructs in which the antisense 
nucleic acid molecule is placed under the control of a 
5 strong pol II or pol III promoter are preferred. 

[0105] In yet another embodiment, the antisense 
nucleic acid molecule of the invention is an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid 
10 molecule forms specific double- stranded hybrids with 
complementary RNA in which, contrary to the usual 0- 
units, the strands run parallel to each other (Gaultier 
et al. (1987) Nucleic Acids. Res. 15:6625-6641). The 
antisense nucleic acid molecule can also comprise a 2 ' - 
15 o-methylribonucleotide (Inoue et al. (1987) Nucleic 

Acids Res. 15:6131-614 8) or a chimeric RNA-DNA analogue 

(Inoue et al . (1987) FEBS Lett. 215:327-330) . 

[0106] In still another embodiment, an antisense 
20 nucleic acid of the invention is a ribozyme . Ribozymes 
are catalytic RNA molecules with ribonuclease activity 
which are capable of cleaving a single -stranded nucleic 
acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., 
2 5 hammerhead ribozymes (described in Haselhoif and 
Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave mRNA transcripts of the genes of 
the invention (e.g., a gene set forth in Tables 1-5 or 
7-10) to thereby inhibit translation of this mRNA. A 
30 ribozyme having specificity for a marker protein- 
encoding nucleic acid can be designed based upon the 
nucleotide sequence of a gene of the invention, 
disclosed herein. For example, a derivative of a 
Tetrahymena L-19 IVS RNA can be constructed in which 
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the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved 
in a marker protein- encoding mRNA. See, e.g., Cech et 
al. U.S. Patent No. 4,987,071; and Cech et al. U.S.. 
5 Patent No. 5,116,742. Alternatively, mRNA transcribed 
from a gene of the invention can be used to select a 
catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel, D. 
and Szostak, J.W. (1993) Science 261:1411-1418. 

10 

[0107] Alternatively, expression of a gene of the 
invention (e.g., a gene set forth in Tables 1-5 or 7- 
10) can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of these genes 

15 (e.g., the promoter and/or enhancers) to form triple 
helical structures that prevent transcription of the 
gene in target cells. See generally, Helene, C. (1991) 
Anticancer Drug Des. 6(6) :569-84; Helene, C. et al. 
(1992). Ann. N. Y. Acad Sci. 660:27-36; and Maher, L.J. 

20 (1992) Bioassays 14 (12) : 807-15. 

[0108] Expression of the marker genes, and genes 
encoding marker proteins of the invention, can also be 
inhibited using RNA interference ("RNAi") . This is a 

25 technique for post transcriptional gene silencing 

("PTGS"), in which target gene activity is specifically 
abolished with cognate double- stranded RNA ( "dsRNA" ) . 
RNA ± resembles in many aspects PTGS in plants and has 
been detected in many invertebrates including 

3 0 trypanosome, hydra, planaria, nematode and fruit fly 
(Drosophila melanogaster) . It may be involved in the 
modulation of transposable element mobilization and 
antiviral state formation. RNAi in mammalian systems is 
disclosed in PCT application WO 00/633 64 which is 
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incorporated by reference herein in its entirety. 
Basically, dsRNA of at least about 600 nucleotides, 
homologous to the target marker is introduced into the 
cell and a sequence specific reduction in gene activity 
5 is observed. See generally, Ui-Teia, K. et al. FEBS 
Letters 479: 79-82. 

[0109] In yet another embodiment, the nucleic acid 
molecules of the present invention can be modified at 

10 the base moiety, sugar moiety or phosphate backbone to 
improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the 
deoxyribose phosphate backbone of the nucleic acid 
molecules can be modified to generate peptide nucleic 

15 acids (see Hyrup B. et al. (1996) Bioorganic & 

Medicinal Chemistry 4 (1): 5 23). As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to 
nucleic acid mimics, e.g., DNA mimics, in which the 
deoxyribose phosphate backbone is replaced by a 

2 0 pseudopeptide backbone and only the four natural 

nucleobases are retained. The neutral backbone of PNAs 
has been shown to allow for specific hybridization to 
DNA and RNA under conditions of low ionic strength. 
The synthesis of PNA oligomers can be performed using 

25 standard solid phase peptide synthesis protocols as 
described in Hyrup B. et al. (1996) supra; Perry - 
O'Keefe et al. Proc . Natl. Acad. Sci. 93: 14670-675. 

[0110] PNAs can be used in therapeutic and diagnostic 
30 applications. For example, PNAs can be used as 

antisense or antigene agents for sequence-specific 
modulation of gene expression by, for example, inducing 
transcription or translation arrest or inhibiting 
replication. PNAs of the nucleic acid molecules of the 
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invention {e.g., a gene set forth in Tables 1-5 or 7- 
10) can also be used in the analysis of single base 
pair mutations in a gene, {e.g., by PNA-directed PCR 
clamping); as 'artificial restriction enzymes 1 when 
5 used in combination with other enzymes, {e.g., SI 
nucleases (Hyrup B. (1996) supra)); or as probes or 
primers for DNA sequencing or hybridization (Hyrup B. 
et al. (1996) supra; Perry-O 1 Keef e supra). 

10 [0111] In another embodiment, PNAs can be modified, 

(e.g., to enhance their stability or cellular uptake), 
by attaching lipophilic or other helper groups to PNA, 
by the formation of PNA- DNA chimeras, or by the use of 
liposomes or other techniques of drug delivery known. in 

15 the art. For example, PNA -DNA chimeras of the nucleic 
acid molecules of the invention can be generated which 
may combine the advantageous properties of PNA and DNA. 
Such chimeras allow DNA recognition enzymes, {e.g., 
RNAse H and DNA polymerases) , to interact with the DNA 

20 portion while the PNA portion would provide high 

binding affinity and specificity. PNA -DNA chimeras can 
be linked using linkers of appropriate lengths selected 
in terms of base stacking, number of bonds between the 
nucleobases, and orientation (Hyrup B. (19 96) supra) . 

25 The synthesis of PNA -DNA chimeras can be performed as 

described in Hyrup B. (1996) supra and Finn P.J. et al . 
(1996) Nucleic Acids Res. 24 (17): 3357-63. For 
example, a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling 

30 chemistry and modified nucleoside analogs, e.g., 5 , -(4- 
methoxytrityl) amino-5 ' -deoxy- thymidine phosphoramidite, 
can be used as a between the PNA and the 5 1 end of DNA 
(Mag, M . et al . (1989) Nucleic Acid Res. 17: 5973-88). 
PNA monomers are then coupled in a stepwise manner to 
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produce a chimeric molecule with a 5' PNA segment and a 
3 1 DNA segment (Finn P.J. et al. (1996) supra). 
Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment (Peterser, 
5 K.H. et al. (1975) Bioorganic Med Chem. Lett. 5: 1119- 
11124) . 

[0112] In other embodiments, the oligonucleotide may 
include other appended groups such as peptides {e.g., 

10 for targeting host cell receptors in vivo) , or agents 
facilitating transport across the cell membrane (see, 
e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:6553-6556; Lemaitre et al . (1987) Pros. Natl. 
Acad Sci. USA 84:648-652; PCT Publication No. 

15 W088/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. W089/10134) . In addition, 
oligonucleotides can be modified with hybridization- 
triggered cleavage agents (See, e.g., Krol et al. 
(1988) Bio -Techniques 6:958-976) or intercalating 

20 agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). 
To this end, the oligonucleotide may be conjugated to 
another molecule, (e.g., a peptide, hybridization 
triggered cross- linking agent, transport agent, or 
hybridization-triggered cleavage agent) . Finally, the 

25 oligonucleotide may be detectably labeled, either such 
that the label is detected by the addition of another 
reagent {e.g., a substrate for an enzymatic label), or 
is detectable immediately upon hybridization of the 
nucleotide (e.g., a radioactive label or a fluorescent 

30 label (e.g., a molecular beacon, as described in U.S. 
Patent 5, 876, 930) . 
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II. Isolated Proteins and Antibodies 

[0113] One aspect of the invention pertains to 
isolated marker proteins, and biologically active 
5 portions thereof, as well as polypeptide fragments 
suitable for use as immunogens to raise anti-marker 
protein antibodies. In one embodiment, native marker 
proteins can be isolated from cells or tissue sources 
by an appropriate purification scheme using standard 
10 protein purification techniques. In another 

embodiment, marker proteins are produced by recombinant 
DNA techniques. Alternative to recombinant expression, , 
a marker protein or polypeptide can be synthesized 
chemically using standard peptide synthesis techniques. 

15 

[0114] An "isolated" or "purified" protein or 
biologically active portion thereof is substantially 
free of cellular material or other contaminating 
proteins from the cell or tissue source from which the 

20 marker protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically 
synthesized. The language "substantially free of 
cellular material" includes preparations of marker 
protein in which the protein is separated from cellular 

25 components of the cells from which it is isolated or 
recombinant ly produced. In one embodiment, the 
language "substantially free of cellular material" 
includes preparations of marker protein having less 
than about 3 0% (by dry weight) of non-marker protein 

30 (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-marker 
protein, still more preferably less than about 10% of 
non-marker protein, and most preferably less than about 
5% non-marker protein. When the marker protein or 

35 biologically active portion thereof is recombinantly 
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produced, it is also preferably substantially free of 
culture medium, i.e., culture medium represents less 
than about 2 0%, more preferably less than about 10%, 
and most preferably less than about 5% of the volume of 
5 the protein preparation. 

[0115] The language n substantially free of chemical 
precursors or other chemicals" includes preparations of 
marker protein in which the protein is separated from 

10 chemical precursors or other chemicals which are 
involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes 
preparations of protein having less than about 30% (by 

15 dry weight) of chemical precursors or non-protein 

chemicals, more preferably less than about 20% chemical 
precursors or non-protein chemicals, still more 
preferably less than about 10% chemical precursors or 
non-protein chemicals, and most preferably less than 

20 about 5% chemical precursors or non-protein chemicals. 

[01161 As used herein, a "biologically active portion" 
of a marker protein includes a fragment of a marker 
protein comprising amino acid sequences sufficiently 

25 homologous to or derived from the amino acid sequence 
of the marker protein, which include fewer amino acids 
than the full length marker proteins, and exhibit at 
least one activity of a marker protein. Typically, 
biologically active portions comprise a domain or motif 

30 with at least one activity of the marker protein. A 
biologically active portion of a marker protein can be 
a polypeptide which is, for example, 10, 25, 50, 100, 
200 or more amino acids in length. Biologically active 
portions of a marker protein can be used as targets for 
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developing agents which modulate a marker protein- 
mediated activity. 

[0117] In a preferred embodiment, marker protein is 
5 encoded by a gene set forth in Tables 1-5 or 7-10. In 
other embodiments, the marker protein is substantially 
homologous to a marker protein encoded by a gene set 
forth in Tables 1-5 or 7-10, and retains the functional 
activity of the marker protein, yet differs in amino 

10 acid sequence due to natural allelic variation or 
mutagenesis, as described in detail in subsection I 
above. Accordingly, in another embodiment, the marker 
protein is a protein which comprises an amino acid 
sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 

15 90%, 95%, 98% or more homologous to the amino acid ' 
sequence encoded by a gene set forth in Tables 1-5 or 
7-10. 

[0118] To determine the percent identity of two amino 
20 acid sequences or of two nucleic acid sequences, the 
sequences are aligned for optimal comparison purposes 

gaps can be introduced in one or both of a first 
and a second amino acid or nucleic acid sequence for 
optimal alignment and non- homologous sequences can be 

2 5 disregarded for comparison purposes) . In a preferred 

embodiment, the length of a reference sequence aligned 
for comparison purposes is at least 30%, preferably at 
least 40%, more preferably at least 50%, even more 
preferably at least 60%, and even more preferably at 
30 least 70%, 80%, or 90% of the length of the reference 
sequence. The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide 
positions are then compared. When a position in the 
first sequence is occupied by the same amino acid 

3 5 residue or nucleotide as the corresponding position in 
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the second sequence, then the molecules are identical 
at that position (as used herein amino acid or nucleic 
acid "identity" is equivalent to amino acid or nucleic 
acid "homology") . The percent identity between the two 
5 sequences is a function of the number of identical 

positions shared by the sequences, taking into account 
the number of gaps, and the length of each gap, which 
need to be introduced for optimal alignment of the two 
sequences . 

10 

[0119] The comparison of sequences and determination 
of percent identity between two sequences can be 
accomplished using a mathematical algorithm. In a 
preferred embodiment, the percent identity between two 

15 amino acid sequences is determined using the Needleman 
and Wunsch (J". Mot. Biol. (48) :444-453 (1970)) 
algorithm which has been incorporated into the GAP 
program in the GCG software package (available at 
http;/ /www. gcg.com) , using either a Blossom 62 matrix 

20 or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 
8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. 
In yet another preferred embodiment, the percent 
identity between two nucleotide sequences is determined 
using the GAP program in the GCG software package 

25 (available at http://www.gcg.com), using a 

NWSgapdna . CMP matrix and a gap weight of 40, 50, 60, 
70, or 8 0 and a length weight of 1, 2, 3, 4, 5, or 6 . 
In another embodiment, the percent identity between two 
amino acid or nucleotide sequences is determined using 

30 the algorithm of E. Meyers and W. Miller (CABIOS, 4:11- 
17 (1989) ) which has been incorporated into the ALIGN 
program (version 2.0), using a PAM 12 0 weight residue 
table, a gap length penalty of 12 and a gap penalty of 
4 . 



35 
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[012 0] The nucleic acid and protein sequences of the 
present invention can further be used as a "query 
sequence" to perform a search against public databases 
to, for example, identify other family members or 
5 related sequences. Such searches can be performed 
using the NBLAST and XBLAST programs (version 2.0) of 
Altschul, et al- (1990) J". Mol. Biol. 215:403-10. 
BLAST nucleotide searches can be performed with the 
NBLAST program, score = 10 0, wordlength = 12 to obtain 

10 nucleotide sequences homologous to nucleic acid 

molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, 
wordlength = 3 to obtain amino acid sequences 
homologous to marker protein molecules of the 

15 invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 
Altschul efc al., (1997) Nucleic Acids Res. 25(17) :3389- 
3402. When utilizing BLAST and Gapped BLAST programs, 
the default parameters of the respective programs 

20 (e.g., XBLAST and NBLAST) can be used. See 
ht tp : / /www . ncbi . nim . nih . gov . 

[0121] The invention also provides chimeric or fusion 
marker proteins. As used herein, a marker "chimeric 
25 protein" or "fusion protein" comprises a marker 
polypeptide operatively linked to a non-marker 
polypeptide. An "marker polypeptide" includes a 
polypeptide having an amino acid sequence encoded by a 
gene set forth in Tables 1-5 or 7-10, whereas a "non- 
30 marker polypeptide" includes. a polypeptide having an 

amino acid sequence corresponding to a protein which is 
not substantially homologous to the marker protein, 
e.g., a protein which is different from marker protein 
and which is derived from the same or a different 
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organism. Within a marker fusion protein the 
polypeptide can correspond to all or a portion of a 
marker protein. In a preferred embodiment, a marker 
fusion protein comprises at least one biologically 
5 active portion of a marker protein. Within the fusion 
protein, the term "operatively linked" is intended to 
indicate that the marker polypeptide and the non-marker 
polypeptide are fused in-frame to each other. The non- 
marker polypeptide can be fused to the N- terminus or C- 
10 terminus of the marker polypeptide . 

[0122] For example, in one embodiment, the fusion 
protein is a GST-marker fusion protein in which the 
marker sequences are fused to the C-terminus of the GST 
15 sequences. Such fusion proteins can facilitate the 
purification of recombinant marker proteins. 

[0123] In another embodiment, the fusion protein is a 
marker protein containing a heterologous signal 
20 sequence at its N-terminus. In certain host cells 
{e.g., mammalian host cells), expression and/or 
secretion of marker proteins can be increased through 
use of a heterologous signal sequence. Such signal 
sequences are well known in the art. 

25 

[0124] The marker fusion proteins of the invention can 
be incorporated into pharmaceutical compositions and 
administered to a subject in vivo, as described herein. 
The marker fusion proteins can be used to affect the 
30 bioavailability of a marker protein substrate. Use of 
marker fusion proteins may be useful therapeutically 
for the treatment of disorders (e.g., multiple 
sclerosis) caused by, for example, (i) aberrant 
modification or mutation of a gene encoding a marker 
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protein; (ii) mis -regulation of the marker protein- 
encoding gene; and (iii) aberrant post- translational 
modification of a marker protein. 

5 [0125] Moreover, the marker -fusion proteins of the 
invention can be used as immunogens to produce anti- 
marker protein antibodies in a subject , to purify 
marker protein ligands and in screening assays to 
identify molecules which inhibit the interaction of a 
10 marker protein with a marker protein substrate. 

[0126] Preferably, a marker chimeric or fusion protein 
of the invention is produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for 

15 the different polypeptide sequences are ligated 
together in- frame in accordance with conventional 
techniques, for example by employing blunt-ended or 
stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling - 

2 0 in of cohesive ends as appropriate, alkaline 

phosphatase treatment to avoid undesirable joining, and 
enzymatic ligation. In another embodiment, the fusion 
gene can be synthesized by conventional techniques 
including automated DNA synthesizers. Alternatively, 

2 5 PCR amplification of gene fragments can be carried out 

using anchor primers which give rise to complementary 
overhangs between two consecutive gene fragments which 
can subsequently be annealed and reamplified to 
generate a chimeric gene sequence (see, for example, 
30 Current Protocols In Molecular Biology, * eds . Ausubel 
et al. John Wiley & Sons: 1992). Moreover, many 
expression vectors are commercially available that 
already encode a fusion moiety (e.g., a GST 
polypeptide) . A marker protein- encoding nucleic acid 

3 5 can be cloned into such an expression vector such that 



WO 02/079218 



PCT7US02/09305 



- 62 - 

the fusion moiety is linked in- frame to the marker 
protein. 

[0127] A signal sequence can be used to facilitate 
5 secretion and isolation of the secreted protein or 
other proteins of interest. Signal sequences are 
typically characterized by a core of hydrophobic amino 
acids which are generally cleaved from the mature 
protein during secretion in one or more cleavage 

10 events. Such signal peptides contain processing sites 
that allow cleavage of the signal sequence from the 
mature proteins as they pass through the secretory 
pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to 

15 polypeptides from which the signal sequence has been 
proteolytically cleaved (i.e., the cleavage products). 
In one embodiment, a nucleic acid sequence encoding a 
signal sequence can be operably linked in an expression 
vector to a protein of interest, such as a protein 

2 0 which is ordinarily not secreted or is otherwise 
difficult to isolate. The signal sequence directs 
secretion of the protein, such as from a eukaryotic 
host into which the expression vector is transformed, 
and the signal sequence is subsequently or concurrently 

25 cleaved. The protein can then be readily purified from 
the extracellular medium by art recognized methods. 

[0128] Alternatively, the signal sequence can be 
linked to the protein of interest using a sequence 
30 which facilitates purification, such as with a GST 
domain . 

[0129] The present invention also pertains to variants 
of the marker proteins of the invention which function 
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as either agonists (mimetics) or as antagonists to the 
marker proteins. Variants of the marker proteins can 
be generated by mutagenesis, e.g., discrete point 
mutation or truncation of a marker protein. An agonist 
5 of the marker proteins can retain substantially the 

same, or a subset, of the biological activities of the 
naturally occurring form of a marker protein. An 
antagonist of a marker protein can inhibit one or more 
of the activities of the naturally occurring form of 

10 the marker protein by, for example, competitively 
modulating an activity of a marker protein. Thus, 
specific biological effects can be elicited by 
treatment with a variant of limited function. In one 
embodiment, treatment of a subject with a variant 

15 having a subset of the biological activities of the 

naturally occurring forth of the protein has fewer side 
effects in a subject relative to treatment with the 
naturally occurring form of the marker protein. 

20 [0130] Variants of a marker protein which function as 
either marker protein agonists (mimetics) or as marker 
protein antagonists can be identified by screening 
combinatorial libraries of mutants, e.g., truncation 
mutants, of a marker protein for marker protein agonist 

25 or antagonist activity. In one embodiment, a 

variegated library of marker protein variants is 
generated by combinatorial mutagenesis at the nucleic 
acid level and is encoded by a variegated gene library. 
A variegated library of marker protein variants can be 

3 0 produced by, for example, enzymatically ligating a 
mixture of synthetic oligonucleotides into gene 
sequences such that a degenerate set of potential 
marker protein sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger 

35 fusion proteins (e.g., for phage display) containing 



WO 02/079218 



PCT/US02/09305 



- 64 - 

the set of marker protein sequences therein. There are 
a variety of methods which can be used to produce 
libraries of potential marker protein variants from a 
degenerate oligonucleotide sequence. Chemical 
5 synthesis of a degenerate gene sequence can be 

performed in an automatic DNA synthesizer , and the 
synthetic gene then ligated into an appropriate 
expression vector. Use of a degenerate set of genes 
allows for the provision, in one mixture, of all of the 

10 sequences encoding the desired set of potential marker 
protein sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., 
Narang, S.A. (1983) Tetrahedron 39:3; Itakura et al. 
(1984) Annu. .Rev. Biochem. 53:323; Itakura et al. 

15 (1984) Science 198:1055; Ike et al . (1983) Nucleic Acid 
Res, 11:477) . 

[0131] In addition, libraries of fragments of a 
protein coding sequence corresponding to a marker 

20 protein of the invention can be used to generate a 

variegated population of marker protein fragments for 
screening and subsequent selection of variants of a 
marker protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a 

25 double stranded PCR fragment of a marker protein coding 
sequence with a nuclease under conditions wherein 
nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form 
double stranded DNA which can include sense/antisense 

3 0 pairs from different nicked products, removing single 
stranded portions from reformed duplexes by treatment 
with SI nuclease, and ligating the resulting fragment 
library into an expression vector. By this method, an 
expression library can be derived which encodes N- 
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terminal, C- terminal and internal fragments of various 
sizes of the marker protein. 

[0132] Several techniques are known in the art for 
5 screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening 
cDNA libraries for gene products having a selected 
property. The most widely used techniques, which are 
amenable to high- throughput analysis, for screening 

10 large gene libraries typically include cloning the gene 
library into replicable expression vectors, 
transforming appropriate cells with the resulting 
library of vectors, and expressing the combinatorial 
genes under conditions in which detection of a desired 

15 activity facilitates isolation of the vector encoding 
the gene whose product was detected. Recursive 
ensemble mutagenesis (REM)., a new technique which 
enhances the frequency of functional mutants in the 
libraries, can be used in combination with the 

2 0 screening assays to identify marker variants (Arkin and 

Yourvan (1992) Proc . Natl. Acad. Sci. USA 89:7811-7815; 
Delgrave et al. (1993) Protein Engineering 6(3):327- 
331) . 

25 [0133] An isolated marker protein, or a portion or 
fragment thereof, can be used as an immunogen to 
generate antibodies that bind marker proteins using 
standard techniques for polyclonal and monoclonal 
antibody preparation. A full-length marker protein can 

30 be used or, alternatively, the invention provides 

antigenic peptide fragments of these proteins for use 
as immunogens . The antigenic peptide of a marker 
protein comprises at least 8 amino acid residues of an 
amino acid sequence encoded by a gene set forth in 

3 5 Tables 1-5 or 7-10, and encompasses an epitope of a 
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marker protein such that an antibody raised against the 
peptide forms a specific immune complex with the marker 
protein. Preferably, the antigenic peptide comprises 
at least 10 amino acid residues, more preferably at 
5 least 15 amino acid residues, even more preferably at 
least 2 0 amino acid residues, and most preferably at 
least 30 amino acid residues. 

[0134] Preferred epitopes encompassed by the antigenic 
10 peptide are regions of the marker protein that are 
located on the surface of the protein, e.g-., 
hydrophilic regions, as well as regions with high 
antigenicity. 

15 [0135] A marker protein immundgen typically is used to 
prepare antibodies by immunizing a suitable subject, 
(e.g., rabbit, goat, mouse or other mammal) with the 
imraunogen. An appropriate immunogenic preparation can 
contain, for example, recombinantly expressed marker 

2 0 protein or a chemically synthesized marker polypeptide. 

The preparation can further include an adjuvant, such 
as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic marker protein preparation 
25 induces a polyclonal anti-marker protein antibody 
response . 

[0136] Accordingly, another aspect of the invention 
pertains to anti-marker protein antibodies. The term 

3 0 "antibody" as used herein includes immunoglobulin 

molecules and immunologically active portions of 
immunoglobulin molecules, i.e., molecules that contain 
an antigen binding site which specifically binds 
(immunoreacts with)' an antigen, such as a marker 
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protein. Examples of immunologically active portions 
of immunoglobulin molecules include F(ab) and Ffab'Jj 
fragments which can be generated by treating the 
antibody with an enzyme such as pepsin. The invention 
5 provides polyclonal and monoclonal antibodies that bind 
to marker proteins. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, 
includes a population of antibody molecules that 
contain only one species of an antigen binding site 
10 capable of immunoreacting with a particular epitope. A 
monoclonal antibody composition thus typically displays 
a single binding affinity for a particular marker 
protein with which it immunoreacts . 

15 [0137J Polyclonal anti-marker protein antibodies can 
be prepared as described above by immunizing a suitable 
subject with a marker protein of the invention. The 
ant i -marker protein antibody titer in the immunized 
subject can be monitored over time by standard 

2 0 techniques, such as with an enzyme linked immunosorbent 

assay (ELISA) using immobilized marker protein. If 
desired, the antibody molecules directed against marker 
proteins can be isolated from the mammal (e.g., from 
the blood) and further purified by well known 
25 techniques, such as protein A chromatography, to obtain 
the IgG fraction. At an appropriate time after 
immunization, e.g., when the anti-marker protein 
antibody titers are highest, antibody-producing cells 
can be obtained. from the subject and used to prepare 

3 0 monoclonal antibodies by standard techniques, such as 

the hybridoma technique originally described by Kohler 
and Milstein (1975) Mature 256:495-497) (see also, 
Brown et al . (1981) J". Immunol. 127:539-46; Brown et 
al. (1980) J\ Biol. Chem. 255:4980-83; Yeh et al. 
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(1976) Proc. Natl. Acad, Sci. USA 76:2927-31; and Yeh 
et al. (1982) Int. CT. Cancer 29:269-75), the more 
recent human B cell hybridoma technique (Kozbor et al. 
(1983) Immunol Today 4:72), the EBV-hybridoma technique 
5 (Cole et al. (1985), Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma 
techniques. The technology for producing monoclonal 
antibody hybridomas is well known (see generally R. H. 
Kenneth, in Monoclonal Antibodies: A New Dimension In 

10 Biological Analyses, Plenum Publishing Corp., New York, 
New York (1980); E. A. Lerner (1981) Yale J*. Biol. 
Med., 54:387-402; M . I* . Gefter et al. (1977) Somatic 
Cell Genet. 3:231-36). Briefly, an immortal cell line 
(typically a myeloma) is fused to lymphocytes 

15 (typically splenocytes) from a mammal immunized with a 
marker protein immunogen as described above, and the 
culture supernatants of the resulting hybridoma cells 
are screened to identify a hybridoma producing a 
monoclonal antibody that binds to a marker protein of 

2 0 the invention. 

[0138] Any of the many well known protocols used for 
fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-marker 
25 protein monoclonal antibody (see, e.g., G. Galfre et 

al. (1977) Nature 266:SSOS2; Gefter et al. Somatic Cell 
Genet., cited supra; Letter, Yale J. Biol. Med., cited 
supra-, Kenneth, Monoclonal Antibodies, cited supra) . 
Moreover, the ordinarily skilled worker will appreciate 

3 0 that there are many variations of such methods which 

also would be useful. Typically, the immortal cell 
line [e.g., a myeloma cell line) is derived from the 
same mammalian species as the lymphocytes. For 
example, murine hybridomas can be made by fusing 
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lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an 
immortalized mouse cell line. Preferred immortal cell 
lines are mouse myeloma cell lines that are sensitive 
5 to culture medium containing hypoxanthine, axninopterin 
and thymidine ("HAT medium") . Any of a number of 
myeloma cell lines can be used as a fusion partner 
according to standard techniques, e.g., the P3-NS1/1- 
Ag4-1, P3-x63-Ag8.653 or Sp210-Agl4 myeloma lines. 

10 These myeloma lines are available from ATCC\ 

Typically, HAT-sensitive mouse myeloma cells are fused 
to mouse splenocytes using polyethylene glycol ("PEG"). 
Hybridoma cells resulting from the fusion are then 
selected using HAT medium, which kills unfused and 

15 unproductively fused myeloma cells (unfused splenocytes 
die after several days because they are not 
transformed) . Hybridoma cells producing a monoclonal 
antibody of the invention are detected by screening the 
hybridoma culture supernatants for antibodies that bind 

20 to a marker protein, e.g., using a standard ELISA 
assay. 

[013 9] Alternative to preparing monoclonal antibody- 
secreting hybridomas, a monoclonal anti-marker protein 

25 antibody can be identified and isolated by screening a 
recombinant combinatorial immunoglobulin library (e.g., 
an antibody phase display library) with marker protein 
to thereby isolate immunoglobulin library members that 
bind to a marker protein. Kits for generating and 

3 0 screening phage display libraries are commercially 
available {e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP™ Phage Display Kit, Catalog No. 
240612) . Additionally, examples of methods and 
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reagents particularly amenable for use in generating 
and screening antibody display library can be found in, 
for example, Ladner et al. U.S. Patent No. 5,223,409; 
Kang et al. PCT International Publication No. "WO 
5 92/18619; Dower et al. PCT International Publication 
No. WO 91/17271; Winter et al. PCT International 
Publication WO 92/20791; Markland et al. PCT 
International Publication No. WO 92115679; Breitling et 
al. PCT International Publication WO 93/012 88; 

10 McCafferty et al. PCT International Publication No. WO 
92/01047; Garrard et al. PCT International Publication 
No. WO 92/09690; Ladner et al. PCT International 
Publication No. WO 90/02809; Fuchs et al. (1991) 
Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. 

15 AntiJbod Hybridomas 3:81-85; Huse et al. (1989) Science 
246:1275-1281; Griffiths et al . (1993) SMBO JT 12:725- 
734; Hawkins et al. (1992) J. Mol. Biol. 226:889-896; 
Clarkson et al. (1991) Nature 352:624-628; Gram et 
al..(1992) Proc. i\Tatl. Acad £?ci. CTSA 89:3576-3580; 

20 Garrad et al. (1991) Bio/Technology 9:1373-1377; 

Hoogenbooiri et al. (1991) Nuc. Acid .Res. 19:4133-4137; 
Barbas et al. (1991) Proc. Natl. Acad. Sci. USA 
88:7978-7982; and McCafferty et al. Nature (1990) 
348:552-554. 

25 

[0140] Additionally, recombinant anti-marker protein 
antibodies, such as chimeric and humanized monoclonal 
antibodies, comprising both human and non-human 
portions, which can be made using standard recombinant 
' 30 DNA techniques, are within the scope of the invention. 
Such chimeric and humanized monoclonal antibodies can 
be produced by recombinant DNA techniques known in the 
art, for example using methods described in Robinson et 
al. International Application No. PCT/US86/02269 ; 



WO 02/079218 



PCT/US02/09305 



- 71 - 

Akira, et al. European Patent Application 184,187; 

Taniguchi, M., European Patent Application 171,496; 

Morrison et al . European Patent Application 173,494? 

Neuberger et al. PCT International Publication No. 
5 WO 86/01533;. Cabilly et al. U.S. Patent No. 4,816,567; 

Cabilly et al. European Patent Application 125,023; 

Better et al. (1988) Science 240:1041-1043; Liu et al. 

(1987) Proc. Natl. Acad Sci . USA 84:3439-3443; Liu et 

al. (1987) J\ Jinmunol. 139:3521 3526; Sun et al . (1987) 
10 Proc, Natl. Acad. Sci. USA 84 -.214-218; Nishimura et al. 

(1987) Cane. Res. 47:999-1005; Wood et al. (1985) 

Nature 314:446-449; and Shaw et al . (1988) iJ. Natl. 

Cancer Inst. 80:1553-1559); Morrison, S.L. (1985) 

Science 229:1202-1207; Oi et al . (1986) BioTechniques 
15 4:214; Winter U.S. Patent 5,225,539; Jones et al . 

(1986) Nature 321:552-525; Verhoeyan et al. (1988) 

Science 239:1534; and Beidler et al. (1988) CT. Immunol. 

141:4053-4060. 

20 [0141] Humanized antibodies are particularly desirable 
for therapeutic treatment of human subjects. Humanized 
forms of non-human (e.g. murine) antibodies are 
chimeric molecules of immunoglobulins, immunoglobulin 
chains or fragments thereof (such as Fv, Fab, Fab' , 

25 F(ab') 2 or other antigen-binding subsequences of 

antibodies) which contain minimal sequence derived from 
non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which 
residues forming a complementary determining region 

3 0 (CDR) of the recipient are replaced by residues from a 
CDR of a non-human species (donor antibody) such as 
mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework 
residues of the human immunoglobulin are replaced by 
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corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in 
the recipient antibody nor in the imported CDR or 
framework sequences. In general, the humanized 
5 antibody will comprise substantially all of at least 
one, and typically two, variable domains, in which all 
or substantially all of the CDR regions correspond to 
those of a non-human immmunoglobulin and all or 
substantially all of the constant regions being those 
10 of a human immunoglobulin consensus sequence. The 
humanized antibody will preferably also comprise at 
least a portion of an immunoglobulin constant region 
(Fc) , typically that of a human immunoglobulin (Jones 
et al. Mature 321: 522-525 (1986); Riechmann et al, 
15 Nature 323: 323-329 (1988); and Presta 
Curr. Op. Struct .Biol. 2: 594-596 (1992). 

[0142J Such humanized antibodies can be produced using 
transgenic mice which are incapable of expressing 
endogenous immunoglobulin heavy and light chains genes, 
but which can express human heavy and light chain 
genes. The transgenic mice are immunized in the normal 
fashion with a selected antigen, e.g., all or a portion 
of a polypeptide corresponding to a marker of the 
invention. Monoclonal antibodies directed against the 
antigen can be obtained using conventional hybridoma 
technology. The human immunoglobulin transgenes 
harbored by the transgenic mice rearrange during B cell 
differentiation, and subsequently undergo class 
switching and somatic mutation. Thus, using such a 
technique, it is possible to produce therapeutically 
useful IgG, igA and IgE antibodies. For an overview of 
this technology for producing humanized antibodies, see 
Lonberg and Huszar (1995) Int. Rev. Immunol. 13:65-93). 
For a detailed discussion of this technology for 



WO 02/079218 



PCTYUS02/09305 



- 73 - 

producing humanized antibodies and humanized monoclonal 
antibodies and protocols for producing such antibodies, 
see, e.g., U.S. Patent 5,625,126; U.S. Patent 
5,633,425; U.S. Patent 5,569,825; U.S. Patent 
5 5,661,016; and U.S. Patent 5,545,8 06. In addition, 
companies such as Abgenix, Inc. (Freemont, CA) , can be 
engaged to provide humanized antibodies directed 
against a selected antigen using technology similar to 
that described above. 

10 

[0143] Humanized antibodies which recognize a selected 
epitope can be generated using a technique referred to 
as "guided selection." In this approach a selected 
non-human monoclonal antibody, e.g., a murine antibody, 
15 is used to guide the selection of a humanized antibody 
recognizing the same epitope (Jespers et al., 1994, 
Bio/ technology 12:899-903) . 

[0144] An anti-marker protein antibody (e.g., 
20 monoclonal antibody) can be used to isolate a marker 
protein of the invention by standard techniques, such 
as affinity chromatography or immunoprecipitation. An 
anti-marker protein antibody can facilitate the 
purification of natural marker proteins from cells and 
25 of recombinantly produced marker proteins expressed in 
host cells. Moreover, an anti-marker protein antibody 
can be used to detect marker protein [e.g., in a 
cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the 
30 marker protein. Anti-marker protein antibodies can be 
used diagnostically to monitor protein levels in tissue 
as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling 
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(i.e., physically linking) the antibody to a detectable 
substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials, bioluminescent 
5 materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and 
10 avidin/biotin; examples of suitable fluorescent 
materials include umbellif erone, fluorescein, 
fluorescein isothiocyanate , rhodamine, 

dichlorotriazinyl amine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material 
15 includes luminol; examples of bioluminescent materials 
include lucif erase, luciferin, and aequorin, and 
examples of suitable radioactive material include 125 I, 
131 I, 35 S or 3 H. 

20 III. Recombinant Expression Vectors and Host Cells 

[0145] Another aspect of the invention pertains to 
vectors, preferably expression vectors, containing a 
nucleic acid encoding a marker protein of the invention 

25 (or a portion thereof) . As used herein, the term 

"vector" includes a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid" , which 
includes a circular double stranded DNA loop into which 
'3 0 additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA 
segments can be ligated into the viral genome. Certain 
vectors are capable of autonomous replication in a host 
cell into which they are introduced {e.g., bacterial 

35 vectors having a bacterial origin of replication and 
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episomal mammalian vectors). Other vectors {e.g., non- 
episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host 
cell, and thereby are replicated along with the host 
5 genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are 
operatively linked. Such vectors are referred to 
herein as "expression vectors". In general, expression 
vectors of utility in recombinant DNA techniques are 

10 often in the form of plasmids. In the present 

specification, "plasmid" and "vector" can be used 
interchangeably as the plasmid is the most commonly 
used form of vector. However, the invention is 
intended to include such other forms of expression 

15 vectors, such as viral vectors (e.g., replication 
defective retroviruses, adenoviruses and adeno- 
associated viruses), which serve equivalent functions. 

[0146] The recombinant expression vectors of the 

2 0 invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 

2 5 expression, which is operatively linked to the nucleic 
acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to 
mean that the nucleotide sequence of interest is linked 
to the regulatory sequences) in a manner which allows 

30 for expression of the nucleotide sequence {e.g., in an 
In vitro transcription/translation system or in a host 
cell when the vector is introduced into the host cell) . 
The term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 

35 elements {e.g., polyadenylation signals). Such 
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regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those which direct 
5 constitutive expression of a nucleotide sequence in 
many types of host cells and those which direct 
expression of the nucleotide sequence only in certain 
host cells (e.g., tissue-specific regulatory 
sequences) . It will be appreciated by those skilled in 

10 the art that the design of the expression vector can 
depend on such factors as the choice of the host cell 
to be transformed, the level of expression of protein 
desired, and the like. The expression vectors of the 
invention can be introduced into host cells to thereby 

15 produce proteins or peptides, including fusion proteins 
or peptides, encoded by nucleic acids as described 
herein (e.g. , marker proteins, mutant forms of marker 
proteins, fusion proteins, and the like) . 

20 [0147] The recombinant expression vectors of the 
invention can be designed for expression of marker 
proteins in prokaryotic or eukaryotic cells. For 
example, marker proteins can be expressed in bacterial 
cells such as E. coli, insect cells (using baculovirus 

25 expression vectors) yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990) . Alternatively, 
the recombinant expression vector can be transcribed 

30 and translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

[0148] Expression of proteins in prokaryotes is most 
often carried out in E . coli with vectors containing 
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constitutive or inducible promoters directing the 
expression of either fusion or non- fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
5 recombinant protein. Such fusion vectors typically 
serve three purposes: 1) to increase expression of 
recombinant protein; 2) to increase the solubility of 
the recombinant protein; and 3) to aid in the 
purification of the recombinant protein by acting as a 

10 ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the 
recombinant protein from the fusion moiety subsequent . 

15 to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor 
Xa, thrombin and enterokinase . Typical fusion 
expression vectors include pGEX (Pharmacia Biotech Inc; 
Smith, D,B. and Johnson, K.S. (1988) Gene 67:31-40), 

2 0 pMAL (New England Biolabs, Beverly, MA) and pRITS 

(Pharmacia, Piscataway, NJ) which fuse glutathione S 
transferase (GST) , maltose E binding protein, or 
protein A, respectively, to the target recombinant 
protein. 

25 

[0149] Purified fusion proteins can be utilized in 
marker activity assays, (e.g., direct assays or 
competitive assays described in detail below) , or to 
generate antibodies specific for marker proteins, for 
30 example. 



[0150] Examples of suitable inducible non-fusion B. 
coli expression vectors include pTrc (Hmann et a!., 
(1988) Gene 69:301-315) and pET lid (Studier et al • , 
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Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
host RNA polymerase transcription from a hybrid trp-lac 
5 fusion promoter. Target gene expression from the pET 
lid vector relies on transcription from a T7 gnlO-lac 
fusion promoter mediated by a coexpressed viral RNA 
polymerase (T7 gnl) . This viral polymerase is supplied 
by host strains BI/21(DE3) or HMS174(DE3) from a 
10 resident prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

[0151] One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a 
host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein 
(Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California 
(1990) 119-128) . Another strategy is to alter the 
nucleic acid sequence of the nucleic acid to be 
inserted into an expression vector so that the 
individual codons for each amino acid are those 
preferentially utilized in E. coli (Wade et al., (1992) 
Nucleic Acids Res. 20:2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

[0152] In another embodiment, the marker protein 
expression vector is a yeast expression vector. 
3 0 Examples of vectors for expression in yeast S. 

cerevisiae include pYepSecl (Baldari, et al., (1987) 
Embo J". 6:229-234), pMFa (Kurjan and Herskowitz, (1982) 
Cell 30:933-943), pJRY88 (Schultz et al . , 21987) Gene 



20 
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54:113-123), pYES2 (InVitrogen Corporation, San Diego, 
CA) , and picZ (Invitrogen Corp, San Diego, CA) . 

[0153] Alternatively, marker proteins of the invention 
5 can be expressed in insect cells using baculovirus 

expression vectors. Baculovirus vectors available for 
expression of proteins in cultured insect cells (e.g\, 
Sf 9 cells) include the pAc series (Smith et al. (1983) 
Mol. Cell Biol. 3:2156-2165) and the pVL series 
10 (Lucklow and Summers (1989) Virology 170:31-39). 

[0154] In yet another embodiment, a nucleic acid of 
the invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 

15 expression vectors include pCDM8 (Seed, B. (1987) 

Mature 329:840) and pMT2PC (Kaufman et al . (1987) EMBO 
J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often 
provided by viral regulatory elements. For example, 

2 0 commonly used promoters are derived from polyoma, 

Adenovirus 2, cytomegalovirus and Simian Virus 40. For 
other suitable expression systems for both prokaryotic 
and eukaryotic cells see chapters 16 and 17 of 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 

2 5 Cloning: A Laboratory Manual • 2nd, ed. . Cold Spring 

Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
host RNA polymerase transcription from a hybrid trp-lac 

3 0 fusion promoter. Target gene expression from the pET 

lid vector relies on transcription from a T7 gnlO-lac 
fusion promoter mediated by a coexpressed viral RNA 
polymerase (T7 gnl) . This viral polymerase is supplied 
by host strains BL2 1 (DE3 ) or HMS174(DE3) from a 
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resident prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

[0155] One strategy to maximize recombinant protein 
5 expression in E. coli is to express the protein in a 
host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein 
(Gottesman, S., Gene Expression Technology: Methods In 
Enzymology 185, Academic Press, San Diego, California 

10 (1990) 119-128) - Another strategy is to alter the 
nucleic acid sequence of the nucleic acid to be 
inserted into an expression vector so that the 
individual codons for each amino acid are those 
preferentially utilized in E* coli (Wada et al., (1992) 

15 Nucleic Acids Res. 20:2111-2118). Such alteration of 
nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

[0156] In another embodiment, the marker protein 
2 0 expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. 
cerevisiae include pYepSecl (Baldari, et al., (1987) 
Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) 
Cell 30:933-943), pJRY88 (Schultz et al . , (1987) Gene 
25 54:113-123), pYES2 (Invitrogen Corporation, San Diego, 
CA) , and picZ (Invitrogen Corp, San Diego, CA) . 

[0157J Alternatively, marker proteins of the invention 
can be expressed in insect cells using baculovirus 
30 expression vectors. Baculovirus vectors available for 
expression of proteins in cultured insect cells (e.g., 
Sf9 cells) include the pAc series (Smith et al. (1983) 
Mol. Cell Biol. 3:2156-2165) and the pVL series 
(Lucklow and Summers (1989) Virology 170:31-39). 
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[0158] In yet another embodiment, a nucleic acid of 
the invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
5 expression vectors include pCDM8 (Seed, B. (1987) 

Mature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO 
J. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often 
provided by viral regulatory elements. For example, 

10 commonly used promoters are derived from polyoma, 

Adenovirus 2, cytomegalovirus and Simian Virus 40. For 
other suitable expression systems for both proJcaryotic 
and eukaryotic cells see chapters 16 and 17 of 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 

15 Cloning: A Laboratory Manual. 2nd, ed, Cold Spring 

Harbor Laboratory, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989. 

[0159] In another embodiment, the recombinant 
20 mammalian expression vector is capable of directing 
expression of the nucleic acid preferentially in a 
particular cell type (e.g., tissue-specific regulatory 
elements are used to express the nucleic acid) . 
Tissue-specific regulatory elements are known in the 
25 art. Non-limiting examples of suitable tissue-specific 
promoters include the albumin promoter (liver-specific; 
Pinkert et al. (1987) Genes Dev. 1:268-277), lymphoid- 
specific promoters (Calame and Eaton (1988) Adv. 
Immunol. 43:235-275), in particular promoters of T cell 
30 receptors (Winoto and Baltimore (1989) EMBO i7. 8:729- 
733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741- 
748), neuron-specific promoters (e.g., the 
neurofilament promoter, Byrne and R.aaddle (1989) Proc . 
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Nail. Acad Sci . USA 86:5473-5477), pancreas-specific 
promoters (Edlund et al . (1985) Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey 
promoter, U.S. Patent No. 4,873,316 and European 
5 Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example 
the marine hox promoters (Kessel and Grass (1990) 
Science 249:374-379) and the a-f etoprotein promoter 
(Campes and Tilghman (1989) Genes Dev. 3:537-546). 

10 

[0160] The invention further provides a recombinant 
expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 
antisense orientation. That is, the DNA molecule is 

15 operatively linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the 
DNA molecule) of an RNA molecule which is antisense to 
mRNA corresponding to a gene of the invention (e.g., a 
gene set forth in Tables 1-5 or 7-10) . Regulatory 

20 sequences operatively linked to a nucleic acid cloned 
in the antisense orientation can be chosen which direct 
the continuous expression of the antisense RNA molecule 
in a variety of cell types, for instance viral 
promoters and/or enhancers, or regulatory sequences can 

25 be chosen which direct constitutive, tissue specific or 
cell type specific expression of antisense RNA. The 
antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 

30 control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type 
into which the vector is introduced. For a discussion 
of the regulation of gene expression using antisense 
genes see Weintraub, H. et al., Antisense RNA as a 
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molecular tool for genetic analysis, .Reviews - Trends 
in Genetics, Vol. 1(1)1986. 

[0161] Another aspect of the invention pertains to 
5 host cells into which a nucleic acid molecule of the 
invention is introduced, e.g., a gene set forth in 
Tables 1-5 or 7-10 within a recombinant expression 
vector or a nucleic acid molecule of the invention 
containing sequences which allow it to homologously 

10 recombine into a specific site of the host cell's 

genome. The terms "host cell" and "recombinant host 
cell" are used interchangeably herein. It is 
understood that such terms refer not only to the 
particular subject cell but to the progeny or potential 

15 progeny of such a cell. Because certain modifications 
may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 
still included within the scope of the term as used 

20 herein. 

[0162] A host cell can be any prokaryotic or 
eukaryotic cell. For example, a marker protein of the 
invention can be expressed in bacterial cells such as 
25 E. coli, insect cells, yeast or mammalian cells (such 
as Chinese hamster ovary cells (CHO) or COS cells) . 
Other suitable host cells are known to those skilled in 
the art . 

30 [0163] Vector DNA can be introduced into prokaryotic 
or eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to 
refer to a variety of art-recognized techniques for 
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introducing foreign nucleic acid {e.g., DNA) into a 
host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated 
transf ection, lipofection, or electmporation . Suitable 
5 methods for transforming or transferring host cells can 
be found in Sambrook, et al. {Molecular Cloning; A 
Laboratory Manual. 2nd, ed. , Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) , and other laboratory manuals. 

10 

[0164] For stable transfection of mammalian cells, it 
is known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 

15 In order to identify and select these integrants, a 
gene that encodes a selectable flag {e.g., resistance 
to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Preferred 
selectable flags include those which confer resistance 

2 0 to drugs, such as G418, hygromycin and methotrexate. 
Nucleic acid encoding a selectable flag can be 
introduced into a host cell on the same vector as that 
encoding a marker protein or can be introduced on a 
separate vector. Cells stably transf ected with the 

25 introduced nucleic acid can be identified by drug 
selection {e.g., cells that have incorporated the 
selectable flag gene will survive, while the other 
cells die) . 

30 [0165] A host cell of the invention, such as a 

prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) a marker protein. 
Accordingly, the invention further provides methods for 
producing a marker protein using the host cells of the 
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invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a 
recombinant expression vector encoding a marker protein 
has been introduced) in a suitable medium such that a 
5 marker protein of the invention is produced. In 
another embodiment, the method further comprises 
isolating a marker protein from the medium or the host 
cell . 

10 [0166] The host cells of the invention can also be 
used to produce non-human transgenic animals. For 
example, in one embodiment, a host cell of the 
invention is a fertilized oocyte or an embryonic stem 
cell into which marker-protein-coding sequences have 

15 been introduced. Such host cells can then be used to 
create non-human transgenic animals in which exogenous 
sequences encoding a marker protein of the invention 
have been introduced into their genome or homologous 
recombinant animals in which endogenous sequences 

2 0 encoding the marker proteins of the invention have been 
altered. Such animals are useful for studying the 
function and/or activity of a marker protein and for 
identifying and/or evaluating modulators of marker 
protein activity. As used herein, a "transgenic 

2 5 animal" is a non-human animal, preferably a mammal, 

more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a 
transgene . Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, 

3 0 chickens, amphibians, and the tike. A transgene is' 

exogenous DNA which is integrated into the genome of a 
cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby 
directing the expression of an encoded gene product in 
35 one or more cell types or tissues of the transgenic 
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animal. As used herein, a "homologous recombinant 
animal" is a non-human animal, preferably a mammal, 
more preferably a mouse, in which an endogenous gene of 
the invention (e.g., a gene set forth in Tables 1-5 or 
5 7-10) has been altered by homologous recombination 
between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g., an 
embryonic cell of the animal, prior to development of 
the animal . 

10 

[0167] A transgenic animal of the invention can be 
created by introducing a marker- encoding nucleic acid 
into the mate pronuclei of a fertilized oocyte, e.g., 
by microinjection, retroviral infection, and allowing 

15 the oocyte to develop in a pseudopregnant female foster 
animal. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the 
efficiency of expression of the transgene. A tissue- 
specific regulatory sequence (s) can be operably linked 

20 to a transgene to direct expression of a marker protein 
to particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 
conventional in the art and are described, for example, 

25 in U.S. Patent Nos . 4,736,866 and 4,870,009, both by 
Leder et al., U.S. Patent No. 4,873,191 by Wagner et 
al. and in Hogan, B., Manipulating the Mouse Embryo, 
(Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. , 1986). Similar methods are used for 

30 production of other transgenic animals. A transgenic 
founder animal can be identified based upon the 
presence of a transgene of the invention in its genome 
and/or expression of mRNA corresponding to a gene of 
the invention in tissues or cells of the animals. A 

35 transgenic founder animal can then be used to breed 
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additional animals carrying the transgene . Moreover, 
transgenic animals carrying a transgene encoding a 
marker protein can further be bred to other transgenic 
animals carrying other transgenes. 

5 

[0168] To create a homologous recombinant animal, a 
vector is prepared which contains at least a portion of 
a gene of the invention into which a deletion, addition 
or substitution has been introduced to thereby alter, 

10 e.g., functionally disrupt, the gene. The gene can be 
a human gene, but more preferably, is a non-human 
homologue of a human gene of the invention (e.g., a 
gene set forth in Tables 1-5 or 7-10) . For example, a 
mouse gene can be used to construct a homologous 

15 recombination nucleic acid molecule, e.g., a vector, 
suitable far altering an endogenous gene of the 
invention in the mouse genome. In a preferred 
embodiment, the homologous recombination nucleic acid 
molecule is designed such that, upon homologous 

2 0 recombination, the endogenous gene of the invention is 
functionally disrupted (i.e., no longer encodes a 
functional protein; also referred to as a "knock out" 
vector) . Alternatively, the homologous recombination 
nucleic acid molecule can be designed such that, upon 

25 homologous recombination, the endogenous gene is 
mutated or otherwise altered but still encodes 
functional protein {e.g., the upstream regulatory 
region can be altered to thereby alter the expression 
of the endogenous marker protein) . In the homologous 

30 recombination nucleic acid molecule, the altered 

portion of the gene of the invention is flanked at its 
5' and 3* ends by additional nucleic acid sequence of 
the gene of the invention to allow for homologous 
recombination to occur between the exogenous gene 
35 carried by the homologous recombination nucleic acid 
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molecule and an endogenous gene in a cell, e.g., an 
embryonic stem cell. The additional flanking nucleic 
acid sequence is of sufficient length for successful 
homologous recombination with the endogenous gene. 
5 Typically, several kilobases of flanking DNA (both at 
the 5' and 3* ends) are included in the homologous 
recombination nucleic acid molecule (see, e.g., Thomas, 
K.R. and Capecchi, M.R. (1987) Cell 51:503 for a 
description of homologous recombination vectors) . The 
10 homologous recombination nucleic acid molecule is 

introduced into a cell, e.g., an embryonic stem cell 
line (e.g., by electroporation) and cells in which the 
introduced gene has homologously recombined with the 
endogenous gene are selected (see e.g., Li, E. et al. 
15 (1992) Cell 69:915). The selected cells can then 

injected into a blastocyst of an animal (e.g., a mouse) 
to form aggregation chimeras (see e.g. Bradley, S A. in 
Teratocareirtomas and Embryonic Stem Cells: A 
Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 
20 1987) pp. 113-152) . A chimeric embryo can then be 

implanted into a suitable pseudopregnant female foster 
animal and the embryo brought to term. Progeny 
harboring the homologously recombined DNA in their germ 
cells can be used to breed animals in which all cells 
2 5 of the animal contain the homologously recombined DNA 
by germline transmission of the transgene. Methods for 
constructing homologous recombination nucleic acid 
molecules, e.g., vectors, or homologous recombinant 
animals are described further in Bradley, A. (1991) 
30 Current Opinion in Biotechnology 2:823-829 and in PCT 
International Publication Nos. : WO 90/11354 by Le 
Mouellec et al.; WO 91/01140 by Smithies et al.; WO 
92/0968 by Zijlstra et al . ; and WO 93/04169 by Berns et 
al. 



WO 02/079218 



PCT/US02/09305 



[0169] In another embodiment, transgenic non-human 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
5 One example of such a system is the cre/loxP 
recombinase system of bacteriophage PI. For a 
description of the cre/loxP recombinase system, see, 
e.g., Laksa et al. (1992) Proc. Natl. Acad. Sci. USA 
89:6232-6236. Another example of a recombinase system 

10 is the FLP recombinase system of Saccharomyces 

cerevlsiae (O f Gorman et al. (1991) Science 251:1351- 
1355. If a cre/loxP recombinase system is used to 
regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase 

15 and a selected protein are required. Such animals can 
be provided through the construction of "double" 
transgenic animals, e.g., by mating two transgenic 
animals, one containing a transgene encoding a selected 
protein and the other containing a transgene encoding a 

2 0 recombinase . 

[0170] Clones of the non-human transgenic animals 
described herein can also be produced according to the 
methods described in Wilmut, I. et al* (1997) Mature 

25 385:810-813 and PCT International Publication Nos . WO 
97/07668 and WO 97/07669. In brief, a cell, e.g., a 
somatic cell, from the transgenic animal can be 
isolated and induced to exit the growth cycle and enter 
G Q phase. The quiescent cell can then be fused, e.g., 

30 through the use of electrical pulses, to an enucleated 
oocyte from an animal of the same species from which 
the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula 
or blastocyte and then transferred to pseudopregnant 
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female foster animal. The offspring borne of this 
female foster animal will be a clone of the animal from 
which the cell, e.g., the somatic cell, is isolated. 

5 IV. Pharmaceutical Compositions 

[0171] The nucleic acid molecules of the invention 
(e.g./ the genes set forth in Tables 1-5 or 7-10), 
fragments of marker proteins, and anti-marker protein 

10 antibodies of the invention can be incorporated into 
pharmaceutical compositions suitable for 
administration. Such compositions (also referred to 
herein as "bioactive agents or compounds") typically 
comprise the nucleic acid molecule, protein, or 

15 antibody and a pharmaceutically acceptable carrier. 

[0172] As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and 

20 antifungal agents, isotonic and absorption delaying 
agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for 
pharmaceutically active substances is well-known in the 
art. Except insofar as any conventional media or agent 

25 is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary 
bioactive agents can also be incorporated into the 
compositions . 

30 [0173] The invention includes methods for preparing 
pharmaceutical compositions for modulating the 
expression or activity of a polypeptide or nucleic acid 
corresponding to a marker of the invention. Such 
methods comprise formulating a pharmaceutically 
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acceptable carrier with an agent which modulates 
expression or activity of a polypeptide or nucleic acid 
corresponding to a marker of the invention. Such 
compositions can further include additional active 
5 agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by 
formulating a pharmaceutical ly acceptable carrier with 
an agent which modulates expression or activity of a 
polypeptide or nucleic acid corresponding to a marker 
10 of the invention and one or more additional bioactive 
agents . 

[0174] The invention also provides methods (also 
referred to herein as "screening assays") for 

15 identifying modulators, i.e., candidate or test 

compounds or agents comprising therapeutic moieties 
(e.g., peptides, peptidomimetics, peptoids, small 
molecules or other drugs) which (a) bind to the marker, 
or (b) have a modulatory (e.g., stimulatory or 

20 inhibitory) effect on the activity of the marker or, 

more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its 
natural substrates (e.g., peptide, protein, hormone, 
co- factor, or nucleic acid) , or (d) have a modulatory 

25 effect on the expression of the marker. Such assays 
typically comprise a reaction between the marker and 
one or more assay components. The other components may 
be either the test compound itself, or a combination of 
test compound and a natural binding partner of the 

3 0 marker. 

[0175] The test compounds of the present invention may 
be bioactive agents, i.e. protein, oligopeptide, 
molecule, polysaccharide, polynucleotides. In a 
35 preferred embodiment the bioactive agents are proteins, 
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in particular naturally occurring proteins or fragments 
thereof . 

[0176] The test compounds of the present invention may 
5 be obtained from any available source, including 
systematic libraries of natural and/or synthetic 
compounds. Test compounds may also be obtained by any 
of the numerous approaches in combinatorial library 
methods known in the art, including: biological 

10 libraries; peptoid libraries (libraries of molecules 
having the functionalities of peptides, but with a 
novel, non-peptide backbone which are resistant to 
enzymatic degradation but which nevertheless remain 
bioactive; see, e.g., Zuckermann et al., 1994, J". Med. 

15 Chem. 37:2678-85); spatially addressable parallel solid 
phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one- 
compound 1 library method; and synthetic library methods 
using affinity chromatography selection. The 

2 0 biological library and peptoid library -approaches are 
limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, 
1997, Anticancer Drug Des . 12:145) . 

25 

[0177] A pharmaceutical composition of the invention 
is formulated to be compatible with its intended route 
of administration. Examples of routes of 
administration include parenteral, e.g., intravenous, 
30 intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal , and rectal 
administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile 
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diluent such as water for injection, saline solution, 
fixed oils, polyethylene glycols, glycerine; propylene 
glycol or other synthetic solvents; antibacterial 
agents such as benzyl alcohol or methyl parabens; 
5 antioxidants such as ascorbic acid or sodium bisulfate; 
chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates 
and agents for the adjustment of tonicity such as 
sodium chloride or dextrose. pH can be adjusted with 
10 acids or bases, such as hydrochloric acid or sodium 

hydroxide. The parenteral preparation can be enclosed 
in ampoules, disposable syringes or multiple dose vials 
made of glass or plastic. 

15 [0178] Pharmaceutical compositions suitable for 

injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for 
the extemporaneous preparation of sterile injectable 
solutions or dispersion. For intravenous 

2 0 administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, NJ) or phosphate buffered saline (PBS) . In 
all cases, the composition must be sterile and should 
be fluid to the extent that easy syringability exists. 

2 5 It must be stable under the conditions of manufacture 

and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria 
and fungi. The earner can be a solvent or dispersion 
medium containing, for example, water, ethanol, polyol 

3 0 (for example, glycerol, propylene glycol, and liquid 

polyetheylene glycol, and the like) , and suitable 
mixtures thereof. The proper fluidity can be 
maintained, for example, by the use of a coating such 
as lecithin, by the maintenance of the requited 
35 particle size in the case of dispersion and by the use 
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of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and 
5 the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, 
polyalcohols such as manitol, sorbitol, sodium chloride 
in the composition. Prolonged absorption of the 
injectable compositions can be brought about by 
10 including in the composition an agent which delays 
absorption, for example, aluminum monostearate and 
gelatin. 

[01791 Sterile injectable solutions can be prepared by 
15 incorporating the active compound (e.g., a fragment of 
a marker protein or an anti-marker protein antibody) in 
the required amount in an appropriate solvent with one 
or a combination of ingredients enmnerated above, as 
required, followed by filtered sterilization. 
20 Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which 
contains a basic dispersion medium and the required 
other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile 
25 injectable solutions, the preferred methods of 

preparation are vacuum drying and f reeze-drying which 
yields a powder of the active, ingredient plus any 
additional desired ingredient from a previously 
sterile-filtered solution thereof. 

30 

[0180] Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 
gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
35 compound can be incorporated with excipients and used 
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in the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and 
5 expectorated or swallowed. Pharmaceutically compatible 
binding agents, and/or adjuvant materials can be 
included as part of the composition. The tablets, 
pills, capsules, troches and the like can contain any 
of the following ingredients, or compounds of a similar 

10 nature: a binder such as microcrystalline cellulose, 
gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic 
acid, Primogel, or corn starch; a lubricant such as 
magnesium stearate or Stertes; a glidant such as 

15 colloidal silicon dioxide; a sweetening agent such as 
sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 

[0181] For administration by inhalation, the compounds 
20 are delivered in the form of an aerosol spray from 
pressured container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon ' 
dioxide, or a nebulizer. 

25 [0182] Systemic administration can also be by 

transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to 
the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in 

3 0 the art, and include, for example, for transmucosal 
administration, detergents, bile salts, and fusidic 
acid derivatives. Transmucosal administration can be 
accomplished through the use of nasal sprays or 
suppositories. For transdermal administration, the 
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bioactive compounds are formulated into ointments, 
salves, gels, or creams as generally known in the art. 

[0183] The compounds can also be prepared in the form 
5 of suppositories (e.g. , with conventional suppository 
bases such as cocoa butter and other glycerides) or 
retention enemas for rectal delivery. 

[0184] In one embodiment, the therapeutic moieties, 

10 which may contain a bioactive compound, are prepared 
with carriers that will protect the compound against 
rapid elimination from the body, such as a controlled 
release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 

15 biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be 
apparent to those skilled in the art. The materials 

20 can also be obtained commercially from Alza Corporation 
and Nova Pharmaceuticals , Inc. Liposomal suspensions 
(including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be 
used as pharmaceutical^ acceptable carriers. These 

25 can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. 
Patent No. 4,522,811. 

[0185] It is especially advantageous to formulate oral 
30 or parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage 
unit form as used herein includes physically discrete 
units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity 
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of active compound calculated to produce the desired 
therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the 
dosage unit forms of the invention are dictated by and 
5 directly dependent on-the unique characteristics of the 
active compound and the particular therapeutic effect 
to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the 
treatment of individuals. 

10 

[0186] Toxicity and therapeutic efficacy of such 
compounds can be determined by standard pharmaceutical 
procedures in cell cultures or experimental animals, 
e.g., for determining the LD50 (the dose lethal to 50% 

15 of the population) and the ED50 (the dose 

therapeutically effective in 50% of the population) . 
The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the 
ratio LD50/ED50, Compounds which exhibit large 

20 therapeutic indices are preferred. While compounds 
that exhibit toxic side effects may be used, care 
should be taken to design a delivery system that 
targets such compounds to the site of affected tissue 
in order to minimize potential damage to uninfected 

25 cells and, thereby, reduce side effects. 

[0187] The data obtained from the cell culture assays 
and animal studies can be used in formulating a range 
of dosage for use in humans. The dosage of such 

30 compounds lies preferably within a range of circulating 
concentrations that include the ED50 with little or no 
toxicity. The dosage may vary within this range 
depending upon the dosage form employed and the route 
of administration utilized. For any compound used in 

35 the method of the invention, the therapeutically 
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effective dose can be estimated initially from cell 
culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration 
range that includes the IC50 (i.e., the concentration 
5 of the test compound which achieves a half-maximal 

inhibition of symptoms) as determined in cell culture. 
Such information can be used to more accurately 
determine useful doses in humans. Levels in plasma may 
be measured, for example, by high performance liquid 
10 chromatography . 

[0188] The nucleic acid molecules of the invention can 
be inserted into vectors and used as gene therapy 
vectors. Gene therapy vectors can be delivered to a 

15 subject by, for example, intravenous injection, local 
administration (see U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et al . (1994) 
Proc. Natl. Acad. Sci. USA 91:3054-3057). The 
pharmaceutical preparation of the gene therapy vector 

2 0 can include the gene therapy vector in an acceptable 

diluent, or can comprise a slow release matrix in which 
the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced 
intact from recombinant cells, e.g., retroviral 

2 5 vectors, the pharmaceutical preparation can include one 
or more cells which produce the gene delivery system. 

[0189] The pharmaceutical compositions can be included 
in a container, pack, or dispenser together with 
30 instructions for administration. 



WO 02/079218 



PCT/US02/09305 



- 99 - 

V. Computer Readable Means and Arrays 

[0190] Computer readable media comprising a marker (s) 
of the present invention is also provided. As used 
5 herein, "computer readable media" includes a medium 
that can be read and accessed directly by a computer. 
Such media include, but are not limited to: magnetic 
storage media, such as floppy discs, hard disc storage 
medium, and magnetic tape; optical storage media such 

10 as CD-ROM; electrical storage media such as RAM and 
ROM; and hybrids of these categories such as 
magnetic/optical storage media. The skilled artisan 
will readily appreciate how any of the presently known 
computer readable mediums can be used to create a 

15 manufacture comprising computer readable medium having 
recorded thereon a marker of the present invention. 

[0191] As used herein, "recorded" includes a process 
for storing information on computer readable medium. 
2 0 Those skilled in the art can readily adopt any of the 
presently known methods for recording information on 
computer readable medium to generate manufactures 
comprising the markers of the present invention. 

25 [0192] A variety of data processor programs and 

formats can be used to store the marker information of 
the present invention on computer readable medium. For 
example, the nucleic acid sequence corresponding to the 
markers can be represented in a word processing text 

30 file, formatted in commercially-available software such 
as WordPerfect and Microsoft Word, or represented in 
the form of an ASCII file, stored in a database 
application, such as DB2 , Sybase, Oracle, or the like. 
Any number of dataprocessor structuring formats (e.g., 
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obtain computer readable medium having recorded thereon 
the markers of the present invention. 

5 [0193] By providing the markers of the invention in 
computer readable form, one can routinely access the 
marker sequence information for a variety of purposes. 
For example, one skilled in the art can use the 
nucleotide or amino acid sequences of the invention in 

10 computer readable form to compare a target sequence or 
target structural motif with the sequence information 
stored within the data storage means. Search means are 
used to identify fragments or regions of the sequences 
of the invention which match a particular target 

15 sequence or target motif. 

[0194] The invention also, includes an array comprising 
a marker (s) of the present invention, i.e. a biochip. 
The array can be used to assay expression of one or 

20 more genes in the array. In one embodiment, the array 
can be used to assay gene expression in a tissue to 
ascertain tissue specificity of genes in the array. In 
this manner, up to about 8600 genes can be 
simultaneously assayed for expression. This allows an 

2 5 expression profile to be developed showing a battery of 
genes specifically expressed in one or more tissues at 
a given point in time. 

[0195] In addition to such qualitative determination, 
30 the invention allows* the quantitation of gene 

expression in the biochip. Thus, not only tissue 
specificity, but also the level of expression of a 
battery of genes in the tissue is ascertainable. Thus, 
genes can be grouped on the basis of their tissue 
35 expression per se and level of expression in that 
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tissue. As used herein, a "normal level of expression" 
refers to the level of expression of a gene provided in 
a control sample, typically the control is from non- 
involved cells or tissues, or from a non-diseased 
5 subject. The determination of normal levels of 

expression is useful, for example, in ascertaining the 
relationship of gene expression between or among 
tissues. Thus, one tissue can be perturbed and the 
effect on gene expression in a second tissue can be 

10 determined. In this context, the effect of one cell 
type on another cell type in response to a biological 
stimulus can be determined. Such a determination is 
useful, for example, to know the effect of cell-cell 
interaction at the level of gene expression. If an 

15 agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell 
type, the invention provides an assay to determine the 
molecular basis of the undesirable effect and thus 
provides the opportunity to co- administer a 

20 counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, 
undesirable biological effects can be determined at the 
molecular level. Thus, the effects of an agent on 
expression of other than the target gene can be 

25 ascertained and counteracted. 

[0196] In another embodiment, the arrays can be used 
to monitor the time course of expression of one or more 
genes in the array. This can occur in various 

30 biological contexts, as disclosed herein, for example 
development and differentiation, disease progression, 
in vitro processes, such a cellular transformation and 
senescence, autonomic neural and neurological 
processes, such as, for example, pain and appetite, and 

35 cognitive functions, such as learning or memory. 
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10197] The array is also useful for ascertaining the 
effect of the expression of a gene on the expression of 
other genes in the same cell or in different cells. 
5 This provides, for example, for a selection of 
alternate molecular targets for therapeutic 
intervention if the ultimate or downstream target 
cannot be regulated. 

10 [0198] The array is also useful for ascertaining 

differential expression patterns of one or more genes 
in non- involved or diseased cells. This provides a 
battery of genes that could serve as a molecular target 
for diagnosis or therapeutic intervention. In 

15 particular, biochips can be made comprising arrays not 
only of the differentially expressed markers listed in 
Tables 1-5 or 7-10, but of markers specific to subjects 
at a certain stage of the disease (i.e. secondary 
progressive, primary progressive, relapsing-remitting) , 

20 or from higher-risk geographical regions (i.e. northern 
Europe or Canada) . 

VI. Predictive Medicine 

25 [0199] The present invention pertains to the field of 
predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenetics and monitoring 
clinical trials are used for prognostic (predictive) 
purposes to thereby treat an individual 

30 prophylactically. Accordingly, one aspect of the 
present invention relates to diagnostic assays for 
determining marker protein and/or nucleic acid 
expression as welt as marker protein activity, in the 
context of a biological sample (e.g., blood, serum, 
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cells, tissue) to thereby determine whether an 
individual is afflicted with a disease or disorder, or 
is at risk of developing a disorder, associated with 
increased or decreased marker protein expression or 
5 activity. The invention also provides for prognostic 
(or predictive) assays for determining whether an 
individual is at risk of developing a disorder 
associated with marker protein, nucleic acid expression 
or activity. For example, the number of copies of a 

10 marker gene can be assayed in a biological sample. 
Such assays can be used for prognostic or predictive 
purposes to thereby phophylactically treat an 
individual prior to the onset of a disorder (e.g., 
multiple sclerosis) characterized by or associated with 

15 marker protein, nucleic acid expression or activity. 

[0200] Another aspect of the invention pertains to 
monitoring the influence of agents {e.g., drugs, 
compounds) on the expression or activity of marker in 
20 clinical trials. 

[0201] These and other agents are described in further 
detail in the following sections. 

25 1. Diagnostic Assays 

[0202] An exemplary method for detecting the presence 
or absence of marker protein or nucleic acid of the 
invention in a biological sample involves obtaining a 
30 biological sample from a test subject and contacting 
the biological sample with a compound or an agent 
capable of detecting the protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes the marker protein such 
that the presence of the marker protein or nucleic acid 
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is detected in the biological sample. A preferred 
agent for detecting mRNA or genomic DNA corresponding 
to a marker gene or protein of the invention is a 
labeled nucleic acid probe capable of hybridizing to a 
5 mRNA or genomic DNA of the invention. Suitable probes 
for use in the diagnostic assays of the invention are 
described herein. 

[0203] A preferred agent for detecting marker protein 

10 is an antibody capable of binding to marker protein, 
preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
{e.g., Fab or F(ab') 2 ) can be used. The term "labeled" , 

15 with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 

20 another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary 
antibody using a f luorescently labeled secondary 
antibody and end- labeling of a DNA probe with biotin 
such that it can be detected with f luorescently labeled 

25 streptavidin. The term "biological sample" is intended 
to include tissues, cells and biological fluids 
isolated from a subject, as well as tissues, cells and 
fluids present within a subject. That is, the 
detection method of the invention can be used to defeat 

30 marker mRNA, protein, or genomic DNA in a biological 
sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of marker mRNA include 
Northern hybridizations and in situ hybridizations. In 
vitro techniques for detection of marker protein 

35 include enzyme linked immunosorbent assays (ELISAs) , 
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Western blots, immunoprecipitations and 

immunofluorescence. In vitro techniques for detection 
of marker genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of marker 
5 protein include introducing into a subject a labeled 
ant i -marker antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard 
imaging techniques. 

10 

[0204] In one embodiment, the biological sample 
contains protein molecules from the test subject. 
Alternatively, the biological sample can contain mRNA 
molecules from the test subject or genomic DNA 
15 molecules from the test subject. A preferred 
biological sample is a serum sample isolated by 
conventional means from a subject. 

[02 05] In another embodiment, the methods further 
20 involve obtaining a control biological sample (e.g., 

noninvolved tissue or from a non-diseased subject) from 
a control subject, contacting the control sample with a 
compound or agent capable of detecting marker protein, 
mRNA, or genomic DNA, such that the presence of marker 
2 5 protein, mRNA or genomic DNA is detected in the 

biological sample, and comparing the presence of marker 
protein, mRNA or genomic DNA in the control sample with 
the presence of marker protein, mRNA or genomic DNA in 
the test sample. 

30 

[0206] The invention also encompasses kits for 
detecting the presence of marker in a biological 
sample. For example, the kit can comprise a labeled 
compound or agent capable of detecting marker protein 
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or mRNA in a biological sample; means for determining 
the amount of marker in the sample; and means for 
comparing the amount of marker in the sample with a 
standard. The compound or agent can be packaged in a 
5 suitable container. The kit can further comprise 

instructions for using the kit to detect marker protein 
or nucleic acid. 

2. Prognostic Assays 

10 

[0207] The diagnostic methods, described herein can 
furthermore be utilized to identify subjects having or 
at risk of developing a disease or disorder associated 
with aberrant marker expression or activity. As used 

15 herein, the term "aberrant" includes a marker 

expression or activity which deviates from the wild 
type marker expression or activity. Aberrant 
expression or activity includes increases or decreased 
expression or activity, as well as expression or 

2 0 activity which does not follow the wild type 

developmental pattern of expression or the subcellular 
pattern of expression. For example, aberrant marker 
expression or activity is intended to include the cases 
in which a mutation in the marker gene causes the 

25 marker gene to be under- expressed or over-expressed and 
situations in which such mutations result in a non- 
functional marker protein or a protein which does not 
function in a wild-type fashion, e.g., a protein which 
does not interact with a marker ligand or one which 

30 interacts with a non marker protein ligand. 

[0208] The assays described herein, such as the 
preceding diagnostic assays or the following assays, 
can be utilized to identify a subject having or at risk 
35 of developing a disorder associated with a 
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misregulation in marker protein activity or nucleic 
acid expression, such as multiple sclerosis. 
Alternatively, the prognostic assays can be utilized to 
identify a subject having or at risk for developing a, 
5 disorder associated with a misregulation in marker 
protein activity or nucleic acid expression, such as 
multiple sclerosis. Thus, the present invention 
provides a method for identifying a disease or disorder 
associated with aberrant marker expression or activity 

10 in which a test sample is obtained from a subject and 
marker protein or nucleic acid {e.g., mRNA or genomie 
DNA) is detected, wherein the presence of marker 
protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 

15 associated with aberrant marker expression or activity. 
As used herein, a "test sample" includes a biological 
sample obtained from a subject of interest. For 
example, a test sample can be a biological fluid (e.g., 
blood PBMCs) , cell sample, or tissue {e.g., brain). 

20 

[0209] Furthermore, the prognostic assays described 
herein can be used to determine whether a subject can 
be administered an agent {e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 

25 molecule, or other drug candidate) to treat a disease 
or disorder associated with increased or degreased 
marker expression or activity. For example, such 
methods can be used to determine whether a subject can 
be effectively treated with an agent for a disorder 

30 such as multiple sclerosis. Thus, the present 

invention provides methods for determining whether a 
subject can be effectively treated with an agent for a 
disorder associated with increased or decreased marker 
expression or activity in which a test sample is 

35 obtained and marker protein or nucleic acid expression 
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or activity is detected (e.g., wherein the abundance of 
marker protein or nucleic acid expression or activity 
is diagnostic for a subject that can be administered 
the agent to treat a disorder associated with increased 
5 or decreased marker expression or activity) . 

[0210] The methods of the invention can also be used 
to detect genetic alterations in a marker gene, thereby 
determining if a subject with the altered gene is at 
10 risk for a disorder characterized by misregulation in 
marker protein activity or nucleic acid expression, 
such as multiple sclerosis. In preferred embodiments, 
the methods include detecting, in a sample of cells 
from the subject, the presence or absence of a genetic 
15 alteration characterized by at least one of an 

alteration affecting the integrity of a gene encoding a 
marker-protein, or the mis-expression of the marker 
gene. For example, such genetic alterations can be 
detected by ascertaining the existence of at least one 
20 of 1) a deletion of one or more nucleotides from a 

marker gene; 2) an addition of one or more nucleotides 
to a marker gene; 3) a substitution of one or more 
nucleotides of a marker gene, 4) a chromosomal 
rearrangement of a marker gene; 5) an alteration in the 
25 level of a messenger RNA transcript of a marker gene, 
6) aberrant modification of a ma=ker gene, such as of 
the methylation pattern of the genomic DNA, 7) the 
presence of a non-wild type splicing pattern of a 
messenger RNA transcript of a marker gene, 8) a non- 
30 wild type level of a marker-protein, 9) allelic loss of 
a marker gene, and 10) inappropriate post- translational 
modification of a marker-protein. As described herein, 
there are a large number of assays known in the art 
which can be used for detecting alterations in a marker 
35 gene. A preferred biological sample is a tissue (e.g., 
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brain) or blood sample isolated by conventional means 
from a subject. 

• 

[0211] In certain embodiments, detection of the 
5 alteration involves the use of a probe/primer in a 

polymerase chain reaction (PCR) (see, e.g., U.S. Patent 
Nos. 4,683, (95 and 4,683,202), such as anchor PCR or 
RACE PCR, or, alternatively, in a ligation chain 
reaction (LCR) (see, e.g., Landegran et al. (1988) 

10 Science 241:1077-1080; and Nakazawa et al. (1994) Proc. 
Mail. Acad. Sci. USA 91:360-364), the latter of which 
can be particularly useful for detecting point 
mutations in the marker-gene (see Abravaya et al . 
(1995) Nucleic Acids Res. 23:675-682). This method can 

15 include the steps of collecting a sample of cells from 
a subject, isolating nucleic acid (e.g., genomic, mRNA 
or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers which 
specifically hybridize to a marker gene under 

20 conditions such that hybridization and amplification of 
the marker-gene (if present) occurs, and detecting the 
presence or absence of an amplification product, or 
detecting the size of the amplification product and 
comparing the length to a control sample. It is 

25 anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with 
any of the techniques used for detecting mutations 
described herein. 

30 [0212] Alternative amplification methods include: 

self sustained sequence replication (Guatelli, JC. et 
al., (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), 
transcriptional amplification system (Kwoh, D.Y. et 
al., (1989) Proc. Natl. Acad. Sci. USA 8 6:1173-1177), 
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Q-Beta Replicase (Lizardi, P.M. et al. (1988) Bio- 
Technology 6:1197) , or any other nucleic acid 
amplification method, followed by the detection of the 
amplified molecules using techniques well known to 
5 those of skill in the art. These detection schemes are 
especially useful for the detection of nucleic acid 
molecules if such molecules are present in very low 
numbers . 

10 [0213] In an alternative embodiment, mutations in a 
marker gene from a sample cell can be identified by 
alterations in restriction enzyme cleavage patterns. 
For example, sample and control DNA is isolated, 
amplified (optionally) , digested with one or more 

15 restriction endonucleases , and fragment length sizes 
are determined by gel electrophoresis and compared. 
Differences in fragment length sizes between sample and 
control DNA indicates mutations in the sample DNA. 
Moreover, the use of sequence specific ribozymes (see, 

2 0 for example, U.S. Patent No. 5,4 98,531) can be used to 
score for the presence of specific mutations by 
development or loss of a ribozyme cleavage site. 

[0214] In other embodiments, genetic mutations in a 
25 marker gene or a gene encoding a marker protein of the 
invention can be identified by hybridizing a sample and 
control nucleic acids, e.g., DNA or RNA, to high 
density arrays containing hundreds or thousands of 
oligonucleotides probes (Cronin, M.T. et al . (1996) 
30 Human Mutation 7: 244-255; Kozal, M.J. et al . (1996) 
Mature Medicine 2: 753-759). For example, genetic 
mutations in marker can be identified in two 
dimensional arrays containing light generated DNA 
probes as described in Cronin, M.T. et al. supra. 
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Briefly, a first hybridization array of probes can be 
used to scan through long stretches of DNA in a sample 
and control to identify base changes between the 
sequences by making linear arrays of sequential 
5 overlapping probes. This step allows the 

identification of point mutations. This step is 
followed by a second hybridization array that allows 
the characterization of specific mutations by using 
smaller, specialized probe arrays complementary to all 
10 variants or mutations detected. Each mutation array is 
composed of parallel probe sets, one complementary to 
the wild- type gene and the other complementary to the 
mutant gene. 

15 [0215] In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
directly sequence the marker gene and detect mutations 
by comparing the sequence of the sample marker with the 
corresponding wild-type (control) sequence. Examples 

20 of sequencing reactions include those based on 

techniques developed by Maxam and Gilbert ( (1977) Proc. 
Natl. Acad. Scl . USA 74:560) or Sanger ((1977) Proc. 
Natl. Acad. Sci . USA 74:5463). It is also contemplated 
that any of a variety of automated sequencing 

25 procedures can be utilized when performing the 
diagnostic assays ((1995) Biotechniques 19:448), 
including sequencing by mass spectrometry (see, e.g., 
PCT International Publication No. WO 94116101; Cohen et 
al. (1996) Adv. Chromatogr, 36:127-162; and Griffin et 

30 al. (1993) Appl. Biochem. Biotechnol . 38:147-159). 



[0216] Other methods for detecting mutations in the 
marker gene or gene encoding a marker protein of the 
invention include methods in which protection from 
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cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al. (1985) 
Science 230:1242). In general, the art technique of 
"mismatch cleavage" starts by providing heteroduplexes 
5 by hybridizing (labeled) RNA or DNA containing the 
wild-type marker sequence with potentially mutant RNA 
or DNA obtained from a tissue sample. The double- 
stranded duplexes are treated with an agent which 
cleaves single-stranded regions of the duplex such as 

10 which will exist due to basepair mismatches between the 
control and sample strands. For instance, RNA/ DNA 
duplexes can be treated with RNase and DNA/ DNA hybrids 
treated with SI nuclease to enzymatically digest the 
mismatched regions. In other embodiments, either 

15 DNA/DNA or RNA/DNA duplexes can be treated with 

hydroxylamine or osmium tetroxide and with piperidine 
in order to digest mismatched regions. After digestion 
of the mismatched regions, the resulting material is 
then separated by size on denaturing polyacrylamide 

20 gels to determine the site of mutation. See, for 

example, Cotton et al. (1988) Proc. Natl Acad Sex USA 
85:4397; Saleeba et al . (1992) Methods Enzymol . 
517:286-295. In a preferred embodiment, the control 
DNA or RNA can be labeled for detection. 

25 

[02171 In still another embodiment, the mismatch 
cleavage reaction employs one or more proteins that 
recognize mismatched base pairs in double-stranded DNA 

(so called "DNA mismatch repair" enzymes) in defined 
30 systems for detecting and mapping point mutations in 
marker cDNAs obtained from samples of cells. For 
example, the mutY enzyme of E. coli cleaves A at G/A 
mismatches and the thymidine DNA glycosylase from HeLa 
cells cleaves T at G/T mismatches (Hsu et al . (1994) 
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Carcinogenesis 15:1657-1652). According to an 
exemplary embodiment, a probe based on a marker 
sequence, e.g., a wild-type marker sequence, is 
hybridized to a cDNA or other DNA product from a test 
5 cell(s). The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any, can 
be detected from electrophoresis protocols or the like. 
See, for example, U.S. Patent No. 5,459,039. 

10 [0218] In other embodiments, alterations in 

electrophoretic mobility will be used to identify 
mutations in marker genes or genes encoding a marker 
protein of the invention. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 

15 differences in electrophoretic mobility between mutant 
and wild type nucleic acids (Orita et al . (1989) Proc 
Natl. Acad. Sci USA: 86:2766, see also Cotton (1993) 
Mutat. Res. 285:125-144; and Hayashi (1992) Genet. 
Anal. Tech Appl. 9:73-79). Single-stranded DNA 

20 fragments of sample and control marker nucleic acids 
will be denatured and allowed to renature. The 
secondary structure of single-stranded nucleic acids 
varies according to sequence, the resulting alteration 
in electrophoretic mobility enables the detection of 

25 even a single base change. The DNA fragments may be 
labeled or detected with labeled probes. The 
sensitivity of the assay may be enhanced by using RNA 
(rather than DNA) , in which the secondary structure is 
more sensitive to a change in sequence. In a preferred 

30 embodiment, the subject method utilizes heteroduplex 
analysis to separate double stranded heteroduplex 
molecules on the basis of changes in elecrtophoretic 
mobility (Keen et al. (1991) Trends Genet 7:5). 
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[0219] In yet another embodiment the movement of 
mutant or wild- type fragments in polyacrylamide gels 
containing a gradient of denaturant is assayed using 
denaturing gradient gel electrophoresis (DGGE) (Myers 
5 et al. (1985) Nature 313:495). When DGGE is used as 
the method of analysis, DNA will be modified to insure 
that it does not completely denature, for example by 
adding a GC clamp of approximately 4 0 by of high- 
melting GC-rich DNA by PCR. In a further embodiment, a 
10 temperature gradient is used in place of a denaturing 
gradient to identify differences in the mobility of 
control and sample DNA (Rosenbaum and Reissner (1987) 
Biophys Chem 265:12753). 

15 [0220] Examples of other techniques for detecting 
point mutations include, but are not limited to, 
selective oligonucleotide hybridization, selective 
amplification, or selective primer extension. For 
example, oligonucleotide primers may be prepared in 

20 which the known mutation is placed centrally and then 
hybridized to target DNA under conditions which permit 
hybridization only if a perfect match is found (Saiki 
et al. (1986) Nature 324:163); Saiki et al. (1989) 
Proc. Natl. Acad. Sci USA 86:6230). Such allele 

25 specific oligonucleotides are hybridized to PCR 

amplified target DNA or a number of different mutations 
when the oligonucleotides are attached to the 
hybridizing membrane and hybridized with labeled target 
DNA. 

30 

[0221] Alternatively, allele specific amplification 
technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
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amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends 
on differential hybridization) (Gibbs et al. (1989) 
Nucleic Acids Res. 17:2437-2448) or at the extreme 3» 
5 end of one primer where, under appropriate conditions, 
mismatch can prevent, or reduce polymerase extension 
(Prossner (1993) Tibtech 11:238) . In addition it may 
be desirable to introduce a novel restriction site in 
the region of the mutation to create cleavage-based 

10 detection (Gasparini et al. (19 92) Mol. Cell Probes 
6:1). It is anticipated that in certain embodiments 
amplification may also be performed using Taq ligase 
for amplification (Barany (1991) Proc. Natl. Acad. Sci 
USA 88:189). In such cases, ligation will occur only 

15 if there is a perfect match at the 3 1 end of the 5 ■ 

sequence making it possible to detect the presence of a 
known mutation at a specific site by looking for the 
presence or absence of amplification. 

20 [0222] The methods described herein may be performed, 
for example, by utilizing prepackaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently 
used, e.g., in clinical settings to diagnose subjects 

25 exhibiting symptoms or family history of a disease or 
illness involving a marker gene. 

[0223] Furthermore, any cell type or tissue in which 
marker is expressed may be utilized in the prognostic 
30 assays described herein. 
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3. Monitoring of Effects During Clinical Trials 

[0224] Monitoring the influence of agents (e.g., 
drugs) on the expression or activity of a marker 
5 protein (e.g., the modulation of multiple sclerosis) 
can be applied not only in basic drug screening, but 
also in clinical trials. For example, the 
effectiveness of an agent determined by a screening 
assay as described herein to increase marker gene 

10 expression, protein levels, or upregulate marker 
activity, can be monitored in clinical trials of 
subjects exhibiting decreased marker gene expression, 
protein levels, or downruegulated marker activity. 
Alternatively, the effectiveness of an agent determined 

15 by a screening assay to decrease marker gene 

expression, protein levels, or downregulate marker 
activity, can be monitored in clinical trials of 
subjects exhibiting increased marker gene expression, 
protein levels, or upregulated marker activity. In 

20 such clinical trials, the expression or activity of a 
marker gene, and preferably, other genes that have been 
implicated in, for example, a marker-associated 
disorder (e.g., multiple sclerosis) can be used as a 
"read out" or markers of the phenotype of a particular 

25 cell. 

[0225] For example, and not by way of limitation, 
genes, including marker genes and genes encoding a 
marker protein of the invention, that are modulated in 
30 ' cells by treatment with an agent (e.g., compound, drug 
or small molecule) which modulates marker activity 
(e.g., identified in a screening assay as described 
herein) can be identified. Thus, to study the effect 
of agents on marker-associated disorders (e.g., 



WO 02/079218 



PCT/US02/09305 



multiple sclerosis), for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for 
the levels of expression of marker and other genes 
implicated in the marker-associated disorder, 
5 respectively. The levels of gene expression {e.g., a 
gene expression pattern) can be quantified by northern 
blot analysis or RT-PCR, as described herein, or 
alternatively by measuring the amount of protein 
produced, by one of the methods as described herein, or 

10 by measuring the levels of activity of marker or other 
genes. In this way, the gene expression pattern can 
serve as a marker, indicative of the physiological 
response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various 

15 points during treatment of the individual with the 
agent . 

[022 6] In a preferred embodiment, the present 
-invention provides a method for monitoring the 

20 effectiveness of treatment of a subject with an agent 

(e.g., an agonist, antagonist, peptidomimetic , protein, 
peptide, nucleic acid, small molecule, or other drug 
candidate identified by the screening assays described 
herein) including the steps of (i) obtaining a pre- 

25 administration sample from a subject prior to 

administration of the agent; (ii) detecting the level 
of expression of a marker protein, mRNA, or genomic DNA 
in the pre-administration sample; (iii) obtaining one 
or more post -administration samples from the subject; 

30 (iv) detecting the level of expression or activity of 
the marker protein, mRNA, or genomic DNA in the post- 
administration samples; (v) comparing the level of 
expression or activity of the marker protein, mRNA, or 
genomic DNA in the pre-administration sample with the 

35 marker protein, mRNA, or genomic DNA in the post 
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administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 
For example, increased administration of the agent may 
be desirable to increase the expression or activity of 
5 marker to higher levels than detected, i.e., to 
increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent 
may be desirable to decrease expression or activity of 
marker to lower levels than detected, i.e. to decrease 
10 the effectiveness of the agent. According to such an 
embodiment, marker expression or activity may be used 
as an indicator of the effectiveness of an agent, even 
in the absence of an observable phenotypic response. 

15 C. Methods of Treatment 

[0227] The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk for (or susceptible to) a disorder or 

2 0 having a disorder associated with aberrant marker 
expression or activity. With regards to both 
prophylactic and therapeutic methods of treatment, such 
treatments may be specifically tailored or modified, 
based on knowledge obtained from the field of 

25 pharmacogenomics . "Pharmacogenomics", as used herein, 
includes the application of genomics technologies such 
as gene sequencing, statistical genetics, and gene 
expression analysis to drugs in clinical development 
and on the market. More specifically, the term refers 

30 the study of how a subject's genes determine his or her 
response to a drug (e.g., a subject's "drug response 
phenotype", or "drug response genotype".) Thus, 
another aspect of the invention provides methods for 
tailoring an individual's prophylactic or therapeutic 

35 treatment with either the marker molecules of the 
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present invention or marker modulators according to 
that individual's drug response genotype. 
Pharmacogenomics allows a clinician or physician to 
target prophylactic or therapeutic treatments to 
5 subjects who will most benefit from the treatment and 
to avoid treatment of subjects who will experience 
toxic drug-related side effects. 

1. Prophylactic Methods 

10 

[022 8] In one aspect, the invention provides a method 
for preventing in a subject, a disease or condition 
{e.g., multiple sclerosis) associated with increased or 
decreased marker expression or activity, by 

15 administering to the subject a marker protein or an 
agent which modulates marker protein expression or at 
least one marker protein activity. Subjects at risk 
for a disease which is caused or contributed to by 
increased or decreased marker expression or activity 

2 0 can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior 
to the manifestation of symptoms characteristic of the 
differential marker protein expression, such that a 

25 disease or disorder is prevented or, alternatively, 

delayed in its progression. Depending on the type of 
marker aberrancy (e.g., increase or decrease in 
expression level), for example, a marker protein, 
marker protein agonist or marker protein antagonist 

30 agent can be used for treating the subject. The 

appropriate agent can be determined based on screening 
assays described herein. 



WO 02/079218 



PCT/US02/09305 



- 120 - 
2. Therapeutic Methods 

[0229] Another aspect of the invention pertains to 
methods of modulating marker protein expression or 
5 activity for therapeutic purposes. Accordingly, in an 
exemplary embodiment, the modulatory method of the 
invention involves contacting a cell with a marker 
protein or agent that modulates one or more of the 
activities of a marker protein activity associated with 

10 the cell. An agent that modulates marker protein 

activity can be an agent as described herein, such as a 
nucleic acid or a protein, a naturally-occurring target 
molecule of a marker protein (e.g., a marker protein 
substrate) , a marker protein antibody, a marker protein 

15 agonist or antagonist, a peptidomimetic of a marker 

protein agonist or antagonist, or other small molecule. 
In one embodiment, the agent stimulates one or more 
marker protein activities. Examples of such 
stimulatory agents include active marker protein and a 

2 0 nucleic acid molecule encoding marker protein that has 
been introduced into the cell. In another embodiment, 
the agent inhibits one or more marker protein 
activities. Examples of such inhibitory agents include 
antisense marker protein nucleic said molecules, anti- 

25 marker protein antibodies, and marker protein 

inhibitors. These modulatory methods can be performed 
in vitro (e.g., by culturing the cell with the agent) 
or, alternatively, in vivo (e.g., by administering the 
agent to a subject) . As such, the present invention 

30 provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant 
expression or activity of a marker protein or nucleic 
acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a 

35 screening assay described herein) , or combination of 
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agents that modulates {e.g., upregulates or 
downregulates) marker protein expression or activity. 
In another embodiment, the method involves 
administering a marker protein or nucleic acid molecule 
5 as therapy to compensate for reduced or aberrant marker 
protein expression or activity. 

[0230] Stimulation of marker protein activity is 
desirable in situations in which marker protein is 

10 abnormally downregulated and/or in which increased 

marker protein activity is likely to have a beneficial 
effect. For example, stimulation of marker protein 
activity is desirable in situations in which a marker 
is downregulated and/or in which increased marker 

15 protein activity is likely to have a beneficial erect. 
Likewise, inhibition of marker protein activity is 
desirable in situations in which marker protein is 
abnormally upregulated and/or in which decreased marker 
protein activity is likely to have a beneficial effect. 

20 

3 . Pharmacogenomics 

[0231] The marker protein and nucleic acid molecules 
of the present invention, as well as agents, inhibitors 

25 or modulators which have a stimulatory or inhibitory 
effect on marker protein activity {e.g., marker gene 
expression) as * identified by a screening assay 
described herein can be administered to individuals to 
treat (prophylactically or therapeutically) marker- 

30 associated disorders (e.gr., multiple sclerosis) 

associated with aberrant marker protein activity. In 
conjunction with such treatment, pharmacogenomics 
(i.e., the study of the relationship between an 
individuals genotype and that individual's response to 
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a foreign compound or drug) may be considered. 
Differences in metabolism of therapeutics can lead to 
severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the 
5 pharmacologically active drug. Thus, a physician or 
clinician may consider applying knowledge obtained in 
relevant pharmacogenomics studies in determining 
whether to administer a marker molecule or marker 
modulator as well as tailoring the dosage and/or 
10 therapeutic regimen of treatment with a marker molecule 
or marker modulator. 

[0232] Pharmacogenomics deals with clinically 
significant hereditary variations in the response to 

15 drugs due to altered drug disposition and abnormal 
action in affected persons. See, for example, 
Eichelbaum, M. et al . (1996) Clin. Exp. Pharmacol. 
Physiol. 23(10-11) :983-985 and Linden, M.W. et al . 
(1997) Clin. Chem. 43 (2) : 254 -266 . In general, two 

20 types of pharmacogenetic conditions can be 

differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body 
(altered drug action) or genetic conditions transmitted 
as single factors altering the way the body acts on 

25 drugs (altered drug metabolism) . These pharmacogenetic 
conditions can occur either as rare genetic defects or 
as naturally-occurring polymorphisms. For example, 
glucose- 6 -phosphate dehydrogenase deficiency (G6PD) is 
a common inherited enzymopathy in which the main 

30 clinical complication is haemolysis after' ingestion of 
oxidant drugs (anti -malarials , sulfonamides, 
analgesics, nitrofurans) and consumption of fava beans. 

[0233] One pharmacogenomics approach to identifying 
35 genes that predict drug response, known as n a genome- 



WO 02/079218 



PCT/US02/09305 



- 123 - 

wide association", relies primarily on a high- 
resolution map of the human genome consisting of 
already known gene-related markers {e.g., a "bi- 
allelic" gene marker map which consists of 60,000- 
5 100,000 polymorphic or variable sites on the human 

genome, each of which has two variants.) Such a high- 
resolution genetic map can be compared to a map of the 
genome of each of a statistically substantial number of 
subjects taking part in a Phase II/III drug trial to 

10 identify markers associated with a particular observed 
drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination 
of some ten-million known single nucleotide 
polymorphisms (SNPs) in the human genome. As used 

15 herein, a "SNP" is a common alteration that occurs in a 
single nucleotide base in a stretch of DNA. For 
example, a SNP may occur once per every 1000 bases of 
DNA. A SNP may be involved in a disease process, 
however, the vast majority may not be disease 

20 associated. Given a genetic map based on the 

occurrence of such SNPs, individuals can be grouped 
into genetic categories depending on a particular 
pattern of SNPs in their individual genome. In such a 
manner, treatment regimens can be tailored to groups of 

25 genetically similar individuals, taking into account 
traits that may be common among such genetically 
similar individuals . 

[0234] Alternatively, a method termed the "candidate 
30 gene approach' 1 , can be utilized to identify genes that 
predict drug response. According to this method, if a 
gene that encodes a drugs target is known (e.g\, a 
marker protein of the present invention) , all common 
variants of that gene can be fairly easily identified 
35 in the population and it can be determined if having 
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one version of the gene versus another is associated 
with a particular drug response. 

[0235] As an illustrative embodiment, the activity of 
5 drug metabolizing enzymes is a major determinant of 
both the intensity and duration of drug action. The 
discovery of genetic polymorphisms of drug metabolizing 
enzymes (e.g., N-acetyltransf erase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2 D6 and CYPZC19) has 
10 provided an explanation as to why some subjects do not 
obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the 
standard and safe dose of a drug. These polymorphisms 
are expressed in two phenotypes in the population, the 
15 extensive metabolizer (EM) and poor metabolizer (PM) . 
The prevalence of PM is different among different 
populations . For example, the gene coding for CYP2D6 
is highly polymorphic and several mutations have been 
identified in PM, which all lead to the absence of 
20 functional CYP2D6 . Poor metabolizers of CYP2D6 and 
CYP2C19 quite frequently experience exaggerated drug 
response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic 
moiety, PM show no therapeutic response, as 
25 demonstrated for the analgesic effect of codeine 

mediated by its CYP2D6-f ormed metabolite morphine. The 
other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. 
Recently, the molecular basis of ultra-rapid metabolism 
30 has been identified to be due to CYP2D6 gene 
amplification. 

[0236] Alternatively, a method termed the "gene 
expression profiling", can be utilized to identify 
3 5 genes that predict drug response. For example, the 
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gene expression of an animal dosed with a drug (e.g., a 
marker molecule or marker modulator of the present 
invention) can give an indication whether gene pathways 
related to toxicity have been turned on. 

5 

[0237] Information generated from more than one of the 
above pharmacogenomics approaches can be used to 
determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment an individual. 

10 This knowledge, when applied to dosing or drug 

selection, can avoid adverse reactions or therapeutic 
failure and thus enhance therapeutic or prophylactic 
efficiency when treating a subject with a marker 
molecule or marker modulator, such as a modulator 

15 identified by one of the exemplary screening assays 
described herein. 

[023 8] This invention is further illustrated by the 
following examples which should not be construed as 
20 limiting. The contents of all references, patents and 
published patent applications cited throughout this 
application, as well as the Figures and Tables are 
incorporated herein by reference. 
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EXAMPLES 

EXAMPLE X: IDENTIFICATION AND CHARACTER I Z AT I ON OF 
MARKER cDNA IN MURINE MODEL OF EXPERIMENTAL AUTOIMMUNE 
5 ENCHE PHAL I T I S (EAE) 

A. Induction of EAE in Mice 

[023 9] 12 SJL mice were immunized subcutaneously with 
10 150 jxg injection of proteolipid protein (PLP) , peptide 
residues 139-151 in complete Freud's adjuvant, a 
protein capable of inducing EAE in mice. After 10 
days, the spleens were harvested, and splenocyte cells 
were isolated and reactivated in vitro for 4 days with 
15 5 ,ug of PLP. The amplified splenocyte cells were then 
injected into SJL mice at 15 x 10 6 cells per mouse. 

B. In Vivo Development of EAE and Isolation of Immune 
Cells and Microglial Cells 

20 

[0240] Development of EAE in mice was characterized by 
three stages: onset, peak and recovery. Onset of EAE 
in mice was characterized by a limp tail 7-8 days after 
injection. Peak development of EAE in mice appeared as 

25 full hind limb paralysis approximately 10-12 days after 
injection, while recovery appeared as hind limb 
weakness approximately 16-18 days after injection. At 
each of the above 3 stages, the following numbers of 
mice were sacrificed and harvested for cervical spinal 

30 cord and mid-brain: 2 mice at onset, 6 at peak and 4 
at recovery. To isolate infiltrating immune cells and 
microglial cells, the samples were run over a percoll 
gradient to remove neurons, endothelial cells and 
astrocytes . 
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C. Isolation of RNA 

[0241] Total RNA was isolated using the RNeasy mini 
kit (Quiagen, Hilden, Germany) . To prepare cRNA for 
5 hybridization, 5 pig of total RNA was denatured at 70°C 
with T7-tagged oligo-dT primer, cooled on ice, then 
reverse transcribed with 200 units Superscript RT II at 
50°C for 1 hour in lx first strand buffer, 10 mM DTT 
and 0.5 mM of each dNTP (Gibco BRL, Gaiethersburg, MD) „ 

10 Second strand cDNA was synthesized by adding 40 units 
DNA pol I, 10 units E. coli DNA ligase, 2 units Rnase 
H, 30 /iL second strand buffer, 3 //I 10 mM each dNTP, 
and water to 150 y£L> final volume and incubating at 
15.8°C for 2 hours. The resulting cDNA was extracted 

15 once with phenol/chlorof orm/isoamylalcohol . CDNA was 
separated on a Phase Lock Gel tube at maximum speed for 

2 min and precipitated with sodium acetate and 100 
ethanol . The resulting pellet was washed with 80% 
ethanol, was dried and was resuspended in 

2 0 diethyl pyrocarbonate - treated (DEPC- treated) water . 

[0242) Labeled RNA was prepared from clones containing 
a T7 RNA polymerase promoter site by incorporating 
labeled ribonucleotides in an in vitro transcription 

25 (IVT) reaction. Half of the purified cDNA was used for 
in vitro transcription with a T7 RNA polymerase kit, 
following manufacturer instructions and using an 
overnight 37 °C incubation, thereby incorporating 
biotinylated CTP and UTP. Labeled RNA was purified 

30 using RNeasy columns (Quiagen) . RNA was concentrated 
and then quant itated by spectrophotometry. Labeled RNA 
(13-15 ^g) was fragmented in 40 mM Tris-acetate 8.0, 
100 mM potassium acetate, 30 mM magnesium acetate for 

3 5 min at 94 C in a total volume of 4 0 ^L. 



WO 02/079218 



PCT/US02/09305 



- 128 - 

D. Array Hybridization and Detection of Fluorescence 

[0243] The labeled and fragmented RNA probes were 
diluted in 1 x MES buffer, BI0948, Bio C, B ere, 100 
5 ^ig/ml herring sperm DNA, and 50 //g/ml acetylated BSA. 
New probes were pre-hybridized in a microfuge tube with 
glass beads at 45°C overnight to remove debris. 
Oligonucleotide arrays composed of approximately 11,000 
murine genes (Microarray, Affymetrix, Cat Nos. SubA 

10 #510243, SubB #510244) were pre-hybridized with 1 x MES 
hybridization buffer at 4 5°C for 5 min and then 
insoluble material was removed by centrif ugation . Pre- 
hybridization buffer was removed from oligo array 
cartridges, 2 00 juL probe added and cartridges were 

15 hybridized for 16 hours at 45°C at 60 rpm. After 

hybridization, probes were removed and the cartridges 
washed extensively with 6 x SSPET using a f luidics 
station (Affymetrix) . Following hybridization, the 
solutions were removed, the arrays were washed with 6x 

20 SSPE-T at 22°C for 7 min, and then washed with 0.5 x 

SSPE-T at 40°C for 15 minutes. When biot in-labeled RNA 
was used, the hybridized RNA was stained with a 
streptavidin-phycoerythrin conjugate (Molecule Probes, 
Eugene, OR) prior to reading. Hybridized arrays were 

25 stained with 2 jug/ml streptavidin-phycoerythrin in 6x 
SSPE-T at 40°C for 5 minutes. The arrays were read 
using a scanning confocal microscope made for 
Affymetrix by Molecular Dynamics (commercially 
available through Affymetrix, Santa Clara, CA) . The 

30 scanner uses an argon ion laser as the excitation 

source, with the emission detected by a photomultiplier 
tube through either a 530 nm bandpass filter 
(fluorescein), or a 560 nm longpass filter 
(phycoerythrin) . Nucleic acids of either sense or 

35 antisense orientations were used in hybridization 
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experiments. Arrays with probes for either orientation 
(reverse complements of each other) are made using the 
same set of photolithographic masks by reversing the 
order of the photochemical steps and incorporating the 
5 complementary nucleotide. 

E. Quantitative Analysis of Hybridization Patterns and 
Insensitivities 

10 [0244] Following a quantitative scan of an array, or 
biochip, a grid is aligned to the image using the known 
dimensions of the array and the corner control regions 
as markers. The image is reduced to a simple text file 
containing position and intensity information using 

15 software developed at Affymetrix (GENECHIP 3.0 

software) . This information is merged with another 
text file that contains information relating physical 
position on the array to probe sequence and the 
identity of the RNA and the specific part of the RNA 

20 for which the oligonucleotide probe is designed. The 
quantitative analysis of the hybridization results 
involves a simple form of pattern recognition based on 
the assumption that, in the presence of a specific RNA, 
the PM probes will hybridize more strongly on average 

2 5 than their MM partners. The number of instances in 

which the PM hybridization signal is larger than the MM 
signal is computed along with the average of the 
logarithm of the PM/MM ratios for each probe set. 
These values are used to make a decision (using a 

3 0 predefined decision matrix) concerning the presence or 

absence of an RNA. To determine the quantitative RNA 
abundance, the average of the differences (PM minus MM) 
for each probe family is calculated. The advantage of 
the difference method is that signals from random 
35 cross-hybridization contribute equally, on average, to 
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the PM and MM probes, while specific hybridization 
contributes more to the PM probes. By averaging the 
pairwise differences, the real signals add 
constructively while the contributions from cross- 
5 hybridization tend to cancel. When assessing the 
differences between two different RNA samples, the 
hybridization signals from side-by-side experiments on 
identically synthesized arrays are compared directly. 
The magnitude of the changes in the average of the 

10 difference (PM-MM) values is interpreted by comparison 
with the results of spiking experiments as well as the 
signals observed for the internal standard bacterial 
and phage RNAs spiked into each sample at a known 
amount. Data analysis programs developed at 

15 Affymetrix, such as the GENECHIP 3.0 software, perform 
these operation automatically. 

[0245] Over 5000 genes were expressed in the central 
nervous system of the murine samples. Nonetheless, 

2 0 distinct gene expression patterns emerged between peak 
disease and onset or recovery of the disease. In order 
to identify the most active genes in EAE development, 
g enes were sought which revealed a pattern of increased 
or decreased regulation at peak, as compared to onset 

25 or recovery. The onset stage served as the baseline 
comparison, with the assumption that the onset stage 
was similar in gene expression to undifferentiated 
naive T cells (i.e., similar to normal control samples 
samples) . The genes demonstrating at least a two-fold 

30 increase at peak in EAE tissue as opposed to non- 
involved tissue are set forth in Table 1, whereas genes 
having a decrease in expression in EAE tissue as 
opposed to non- involved tissue are set forth in Table 
2. There are several genes which were previously known 

35 to be associated with EAE (e.g., which are differently 
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expressed in MS tissue): TNF-oc and IFN-p. The up- 
regulation of these genes served as validation to the 
method. 

5 [0246] To generate the data in Tables 1 and 2, genes 
were clustered hierarchically into groups on the basis 
of similarity of their expression profiles by the 
procedure of Eisen et al . {(1998) Proc. Nat'l Acad. 
Sci. USA 95:25: 14863-8). Genes that were designated 

10 absent (A) in all samples in a given experiment were 
eliminated from the analysis, as were -fold changes 
over the designated baseline of less than 2 . Genes 
that were present (P) and were of unclear expression 
(M) were also indicated. Average fold changes between 

15 peak/onset and peak/recovery stages were also 
calculated. 

F. Identification of Immune Mediated Genes by 
20 Comparison to In Vitro Expression Patterns 

[0247] In addition, to identify disease-related, 
antigen-driven immune mediated genes, additional mice 
were immunized with an EAE- inducing factor and tissue 

25 harvested for splenocytes. 3 SJL mice were immunized 
on Day 0 with 400 ^9 pertussis toxin and 200 //g 
injection of myelin oligodendrocyte glycoprotein (MOG) 
peptide residue 35-55 which, like PLP, induces EAE in 
mice. On Day 3, an additional dose of 400 jug pertussis 

30 toxin was administered. After 10 days, the spleens 

were harvested, and splenocyte cells were isolated and 
cultured in vitro with 5 A^g/ml MOG for 6 and 2 4 hours, 
to monitor expression of an immune response at an 
earlier time (6 hours) and at a later time (24 hours) . 

35 In addition, a subset of the splenocyte cells were 
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further treated with 5 M9/ntl of anti-IL12p40 (C17.8) 
antibody, which is believed to produce inhibitory- 
effects on the expression of MS (IL12p40 is an 
interleukin that induces iFN-y expression in the Th-1 
5 pathway) . 

[0248] Isolation of mRNA was performed as described 
above, with the in vitro samples being compared to 
murine 11,000 genes {Microarray, Affymetrix, Cat Nos. 

10 SubA #510243, SubB #510244). As shown in Table 3, MOG 
restimulation induced differential expression of 199 
genes. Of these genes, Tables 4 and 5 indicate 17 
genes which further responded to treatment with anti- 
IL12p40. In addition, quantitative analysis was 

15 performed by comparison of the in vitro samples to the 
in vivo studies, to identify common genes which were 
increased or decreased substantially, and/or which were 
affected by addition by MOG + antiILl2p40. Of the 1265 
genes which were found to increase or decrease greater 

20 than three fold following MOG restimulation, 184 were 
found to be in common with genes expressed at peak 
disease of EAE. Further analysis revealed 6 genes in 
particular which also responded to treatment with anti- 
IL12p4 0 (shown in Table 5) : CAPN12 , MT1 , MYOIF, TLN, 

25 UNK_AA117532 and UNK_AA645990 . 

EXAMPLE 2: IDENTIFICATION AND CHARACTERIZATION OF 
MARKER CDNA IN HUMAN MODEL OF MULTIPLE SCLEROSIS. 

30* A. Study Design and Subject Entry Criteria 

[0249] 60 patients diagnosed with varying stages of 
multiple sclerosis participated in the study from three 
sites: University Rochester (Rochester, NY), Lehigh 
35 Valley Hospital (Allentown, PA) and Institute of 
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Neurology, University of London (London, UK) . The 
University of Rochester and Lehigh Valley Hospital 
provided peripheral blood mononuclear cell (PBMC) 
samples, while the Institute of Neurology provided 
5 brain lesion and non-lesional samples. 

[0250] As is characteristic of MS, the patients 
manifested other neurological disorders and a wide 
panoply of symptoms, and were at different stages of 
10 the disease: relapsing-remitting, 2° progressive, 1° 
progressive and acute exacerbation. The samples 
included post-mortem lesional and nonlesional brain 
samples, as well as other normal controls. 

[0251] In general, the donor profile for PBMC samples 
had the following requirements: no current use of 
steroids (washout of 3 0 days) , no current use of 
interferon or Copaxone (washout 90 days) , no previous 
history of cyclophosphamide use, no active viral 
illnesses or infections and no current use of any other 
study drug. 

B. Identification of MS markers from Blood or Brain 

25 [0252] PBMC were isolated from blood samples using 
Lymphoprep (Nycomed, Oslo, Norway) . Lesion and 
nonlesion brain samples were removed from brain stems 
and washed To isolate infiltrating immune cells and 
microglial cells, the samples were run over percoll 

30 gradient to remove neurons, endothelial cells and 
astrocytes . 

[0253] As was performed in Experiment 1(C) above, RNA 
was isolated from the PBMC and brain samples using the 



WO 02/079218 



PCT/US02/09305 



- 134 - 

RNeasy mini kit (Quiagen) , the RNA probes were 
hybridized against human chip arrays having 
approximately 14,000 genes (MicroArray, Affymetrix, cat 
no. 510448) and quantitative analysis was performed 
5 using GeneChip. PBMC samples from MS patients were 
compared to PBMCs from undiseased pateints, resulting 
in over 3 00 differentially regulated genes by more than 
2 fold (p<.01), as shown in Table 7. Lesions from the 
MS brain were compared to nonlesion samples, resulting 
10 in over 100 genes, as shown in Table 8, that increased 
or decreased more than 2 fold (p<.01). A comparison of 
the genes expressed in brain and PBMC samples indicated 
that only 181 genes were shown to be present in both. 

15 C. Comparison to In Vitro Expression Patterns 

[0254] These genes were compared to genes identified 
in the murine samples of Example 1(F). Of the 181 
genes which were either up- or down-regulated in the 
2 0 human PBMC and brain samples, 6 genes were shown to be 
in common with the murine genes that were 
differentially regulated in vitroi EEF1D, PIM2, PRDX2, 
SEC24C, UNK_AJ24 AND XIP (these genes and their 
accession numbers are listed in Table 9) . 

25 

D. Taqman Polymerase Chain Reaction 

[0255] To ensure that the data obtained from the 
GeneChip analysis (described above) was reflective of 

30 the actual level of gene expression in the cell 

samples, the expression of 4 selected cytokines, IL-10, 
IL-8, IL-12p35 and IL-lp, were also measured by 
polymerase chain reaction, and the results compared to 
the expression levels of the same cDNAs of the 

35 cytokines in non- involved samples and in GeneChip 
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analysis. Total RNA was treated with 10 units RQ1 
Dnase I (Promega, Madison, WI) for 30 min at 37°C. 
Samples were extracted with phenol/chloroform, and RNA 
was precipitated with 0.3 M sodium acetate and 2 
5 volumes of 100% ethanol . RNA was resuspended in DEPC- 
treated water, and the RNA concentration determined by 
measuring the optical absorbance at 260 nm. Then DNA 
polymerase was used to reverse transcribe and amplify 
25 /zg of total RNA in a single tube assay using the 

10 Perkin-Elmer TaqMan EZ RT-PCR kit (Perkin-Elmer Applied 
Biosystems, Foster City, CA) with gene-specific sense 
and anti^sense primers and a probe f luorescently 
labeled at the 5' end with 6 -carboxyl -fluorescein (6- 
FAM) {Kruse (1997) J Immunol. Methods 210 (2 ): 195-203 ; 

15 Heid et al . (1996) Genome Res 6 (10) : 986-94) . Primers 
and f luorescently labeled probes were generated using 
Primer Express software (Perkin-Elmer) , and were 
synthesized by Perkin-Elmer. To avoid amplification of 
contaminating genomic DNA, primer pairs crossing 

20 intron/exon boundaries were selected. Duplicate 

samples were, reverse transcribed for 30 min at 60°C and 
then subjected to 40 rounds of amplification for 15 sec 
at 95°C and 1 min at 60°C using the ABI Prism 7700 
sequence detection system as described by the 

25 manufacturer (Perkin-Elmer) (Kruse et al. (1997) supra) . 
Sequence-specific amplification was detected as an 
increased fluorescence signal of 6-FAM during the 
amplification cycle. Quantitation of gene- specif ic 
message levels was based on a comparison of the 

30 fluorescence intensity in the unknown mRNA sample to 
the fluorescence intensity from the standard curve of 
known mRNA levels. Amplification of the gene for human 
acidic ribosomal protein (HARP) was performed on all 
samples tested to control for variations in RNA 

35 amounts. MRNA of the above-mentioned genes were 
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normalized to this control mRNA. The results are shown 
in Figure 1, and demonstrate that the average 
expression of the above-mentioned genes were increased 
in MS tissues relative non-involved tissues, as was 
5 also confirmed from the GeneChip analysis. 

E. MS CLASS PREDICTOR 

[0256] A panel of differentially regulated genes were 
10 selected and compiled for use in a class predictor 
model, by modifying the model described in Golub et 
al., "Molecular Classification of Cancer: Class 
Discovery and Class Prediction by Gene Expression 
Monitoring" Science, 286: 531-537 (1999). The genes 
15 were selected based on a * neighborhood analysis" of how 
well they correlated to a class distinction, e.g. MS- 
afflicted or not. Each class is represented by an 
idealized expression pattern of genes, in which the 
expression level is uniformly high in one class (e.g. 
20 MS-af f licted) or uniformly low in the other (e.g. 

nondiseased) . The panel of markers selected for the 
class predictor is shown in Table 10, together with 
their Accession numbers. 

25 [0257] The prediction of whether a new sample is MS- 
af f licted or nondiseased is based on 'weighted votes' 
on the panel of selected genes. Each selected gene 
votes for either MS-afflicted or nondiseased, depending 
on whether its expression level X ± in the sample is 

3 0 closer to ^ MS or Mnon- disease - The magnitude of the vote is 
WiVj, where ^ is a weighing factor that reflects how 
well the gene is correlated with the class distinction 
and V, = 1X^(^3 + 

A*non-di eeased 

)/2| reflects the deviation 
of the expression level in the sample from the average 
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of ^ MS and Mnon- diseased- The sample is assigned a 
qualitative outcome . 

[0258] Validation of the class predictor was performed 
5 by using samples from subjects known to be MS-afflicted 
or nondiseased. As shown in Figure 2, 17 out of 18 
samples (-94%) were correctly predicted as either MS- 
afflicted or non-diseased, demonstrating that the class 
predictor may be used as a viable means for diagnosing 

10 a subject with multiple sclerosis. Additional data 

points on the selected panel of genes should only bring 
greater accuracy to the model's predictions. Moreover, 
it is possible that with further compilation of data 
from various stages or types of MS, the class predictor 

15 may be able to different the MS disease stage of a 
subject . 

Other variations and modifications of this invention 
will be obvious to those skilled in the art. This 
20 invention is not limited except as set forth in the 
claims . 
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Table 6 





A 


8 


c 




E 


1 


Table of Genes Known to be Associated with MS 


9 












'I 
O 


IL-12p40 










A 
** 


IL-10 










yJ 


ICAM-2 










O 


GAP-43 










-7 
I 


a-2 chimmaerin 








a 
O 


myetin-transcription factor- 1 






Q 


activin type II receptor | 






lU 


retinoic acid receptor ct-1 






1 1 


EF-1 | | 






1 O 

141 


neuron-specific enolase 






1 O 


proton ATPase 






1/1 


Glutaminyl-tRNA synthetase 






1^ 


Na, K-ATPase beta subunit 






1fi 


neuroendocrine-specific protein A 






1 7 
1 r 


alpha-N-acetylgalactosaminidase 






I o 


iiver type alkaline phosphatase 






1Q 


male-enhanced antigen 






9PI 


mitochondrial enoyl-CoA hydratase 




1 


import precursor-ATP synthase a 








serine dehydratase | 






23 


adenine nucleotide transIocator-2 






24 


ornithine decoboxylase antienzym 


e 




25 


protein kinase C type p-1 






26 


cell surface glycoprotein A15 






27 


IRF-2 | | 






28 


56-K autoantigen annexin XI 






29 


high endothelial venule 






30 


leukotriene A-4 hydrolase 






31 


transcription factor E2A 






32 


TNFa receptor-2 | 






33 


duffy chemokine receptor 






34 


MS-1 










35 


PRG-2 










36 


MBP 










37 


protein tyrosine phosphatase, receptor type, C 


38 


MOG l 








39 


Interferon pi 








40 


ICAM-1 










41 


CTLA4 










42 


APOE 
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Table 10 





A | B | C 


D 






Peripheral Blood Genes Predicitve of MuHi 


Die Sclerosis 




2 












3 


Gene 


Observed 


Random Perm 1% 


Random Perm 5% 


Randon Perm (Median) 


4 


FOS 


2.562527 


1.4817816 


1.1614933 


0.7007793 


5 


U19261 


2.114882 


1 .3043897 


0.87986296 


0.53156734 


6 


DUSP1 


1.939225 


1.2410891 


0.8299547 


0.5107011 


7 


KIAA0906 


1.87203 


1.2133218 


0.80576986 


0.48326102 


8 


MAP4K1 


1.860437 


1.2085855 


0.79408497 


0.47091565 


9 


DGKZ 


1.837408 


1 .2069645 


0.77625513 


0.4690797 


10 


SNRP70 


1.81934 


1.1756094 


0.7738778 


0.45766842 


11 


ETR101 


1.801398 


1.1731951 


0.76643366 


0.440602 


12 


KIAA0864 


1.775063 


1.1709846 


0.7597211 


0.43995833 


13 


SFRS8 


1.77253 


1.1582581 


0.7580478 


0.43687758 


14 


AI890903 


1.764722 


1.1560028 


0.7550534 


0.4319513 


15 


JUNB 


1.752881 


1.1488212 


0.74930257 


0.4269654 


16 


STAT4 


1.719926 


1.1444192 


0.7461597 


0.41349095 


17 


AGP3 


1.705897 


1.130565 


0.7446831 


0.4113997 


18 


HSU79253 


1.664187 


1.1186187 


0.7290762 


0.4062087 


19 


BIN1 


1.649725 


1.1111517 


0.7287055 


0.402228 


20 


ATM 


1.641806 


^ 1.1030456 


0.7274922 


0.39897352 


21 


EDG4 


1.634613 


1.0987875 


0.7267664 


0.39316127 


22 












23 


Prediction based on correlation metrics proposed by Gollub et. al. (1999) 
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WHAT IS CLAIMED: 

1. A method of diagnosing a subject with multiple 
sclerosis, the method comprising the step of comparing: 

a) a level of expression of a marker in a sample 
from the subject, wherein the marker is selected from 
the group consisting of markers listed in Tables 7-10, 
and 

b) a normal level of expression of the marker in a 
control sample, 

wherein a substantial difference between the level 
of expression of the marker in the sample from the 
subject and the normal level is an indication that the 
subject is afflicted with multiple sclerosis. 

2. The method of claim 1, wherein the marker 
corresponds to a transcribed polynucleotide or a 
portion thereof. 

3. The method of claim 2, wherein the sample is 
collected from brain tissue. 

4. The method of claim 2, wherein the sample is 
peripheral blood mononuclear cells. 

5. The method of claim 1, wherein the control sample 
is from a non-diseased subject and the substantial 
difference is a factor of at least about 2. 

6. The method of claim 1, wherein the control sample 
is from non-involved tissue of the subject and the 
substantial difference is a factor of at least about 2. 
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7. The method of claim 1, wherein the control sample 
is from non- involved tissue of the subject and the 
substantial difference is a factor of at least about 5. 

8. The method of claim 1, wherein the level of 
expression of the marker in the sample is assessed by- 
detecting the presence in the sample of a protein 
corresponding to the marker. 

9. The method of claim 8, wherein the presence of the 
protein is detected using a reagent which specifically 
binds with the protein. 

10. The method of claim 9, wherein the reagent 
comprises an antibody or fragments thereof. 

11. The method of claim 1, wherein the marker is 
selected from the markers listed in Table 9. 

12. The method of claim l, wherein the marker is 
selected from the markers listed in Table. 10. 

13. The method of claim 1, wherein the level of 
expression of the marker in the sample is assessed by 
detecting the presence in the sample of a transcribed 
polynucleotide or portion thereof, wherein the 
transcribed polynucleotide comprises the marker. 

14. The method of claim 13, wherein the transcribed 
polynucleotide is a mRNA. 

15. The method of claim 13, wherein the transcribed 
polynucleotide is a cDNA. 
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16. The method of claim 1, wherein the level of 
expression of the marker in the sample is assessed by 
detecting the presence in the sample of a transcribed 
polynucleotide or a portion thereof which hybridizes 
with a labeled probe under stringent conditions, 
wherein the transcribed polynucleotide comprises the 
marker. 

17. A method of diagnosing a subject with multiple 
sclerosis, the method comprising the step of comparing: 

a) a level of expression in a sample of the 
subject of each of a panel of markers independently 
selected from the markers listed in Tables 7-10, and 

b) a normal level of expression of the panel of 
markers obtained from a control sample, 

wherein the level of expression of the panel of 
markers is substantially different, relative to the 
corresponding normal level of expression of the panel 
of markers, indicates that the subject is afflicted 
with multiple sclerosis. 

18. The method of claim 17, wherein the panel of 
markers comprises at least 5 markers. 

19. The method of claim 17, wherein the control sample 
is from a non-diseased subject. 

20. The method of claim 17, wherein the control sample 
is from non-involved tissue of the subject. 

21. The method of claim 17, wherein the panel of 
markers comprises markers listed in Table 10. 
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22. A method for monitoring the progression of 
multiple sclerosis in a subject, the method comprising 
the steps of: 

a) detecting in a subject sample at a first point 
in time, a level of expression of at least one marker, 
wherein the at least one marker is selected from the 
group consisting of markers listed in Tables 7-10; 

b) repeating step a) at a subsequent point in time 
with the at least one marker; 

c) detecting a substantial difference between the 
levels of expression detected in steps a) and b) ; 

wherein the substantial difference between the 
levels of expression indicates that the subject has 
progressed to a different stage of multiple sclerosis. 

23. The method of claim 22, wherein at least 5 markers 
are selected from the group of markers listed in Tables 
7-10. 

24. The method of claim 22, wherein the at least one 
marker corresponds to a transcribed polynucleotide or 
portion thereof. 

25. The method of claim 24, wherein the samples are 
collected from brain tissue. 

26. The method of claim 24, wherein the samples are 
peripheral blood mononuclear cells. 

27. A method of assessing the efficacy of a test 
compound for inhibiting multiple sclerosis in a 
subject, the method comprising the step of comparing: 

a) expression of a marker in a first sample 
obtained from the subject and exposed to the test 
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compound, wherein the marker is selected from the group 
consisting of markers listed in Tables 7-10, and 

b) expression of the same marker in a second 
sample obtained from the subject, wherein the second 
sample is not exposed to the test compound, 

wherein a substantially different level of 
expression of the marker in the first sample, relative 
to the second sample, is an indication that the test 
compound is efficacious for inhibiting multiple 
sclerosis in the subject. 

28. The method of claim 27, wherein the first and 
second samples are portions of a single sample obtained 
from the subject. 

29. The method of claim 28, wherein the level of 
expression in the first sample approximates the level 
of expression in a control sample. 

30. A method of assessing the efficacy of a therapy 
for inhibiting multiple sclerosis in a subject, the 
method comprising the steps of comparing: 

a) expression of a marker in a first sample 
obtained from the subject prior to providing at least a 
portion of the therapy to the subject, wherein the 
marker is selected from the group consisting of markers 
listed in Tables 7-10, and 

b) expression of the marker in a second sample 
following provision of the portion of the therapy, 

wherein a substantially different level of 
expression of the marker in the second sample, relative 
to the first sample, is an indication that the test 
compound is efficacious for inhibiting multiple 
sclerosis in the subject. 
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31. The method of claim 30, wherein a substantially 
similar level of expression in the second sample, 
relative to the control sample, is an additional 
indication that the test compound is efficacious for 
inhibiting multiple sclerosis. 

32. The method of claim 31, further comparing 
expression of the marker in a control sample, wherein 
expression of the marker in the second sample is 
similar to expression of the marker in the control 
sample . 

33. A method of screening test compounds for 
inhibitors of multiple sclerosis, the method comprising 
the steps of : 

a) obtaining a sample comprising cells from the 
subject; 

b) separately maintaining aliquots of the sample 
in the presence of a plurality of test compounds; 

c) comparing the expression levels of a marker in 
each of the aliquots, wherein the marker is selected 
from the group consisting of markers listed in Tables 
7-10; and 

d) selecting one of the test compounds which 
induces a substantially different level of expression 
of the marker in the aliquot containing that test 
compound, relative to other test compounds. 

34. The method of claim 33, wherein the substantially 
different level of expression is a substantially lower 
level of expression. 

35. The method of claim 34, wherein the substantially 
different level of expression is a substantially 
enhanced level of expression. 
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36. A kit for diagnosing a subject with multiple 
sclerosis, the kit comprising reagents for assessing 
expression of a marker selected from the group 
consisting of markers listed in Tables 7-10. 

37. A kit for diagnosing multiple sclerosis in a 
subject, the kit comprising a nucleic acid probe 
wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from 
the group consisting of markers listed in Tables 7-10. 

38. A kit for assessing the suitability of each of a 
plurality of compounds for inhibiting progression of 
multiple sclerosis in a subject, the kit comprising: 

a) the plurality, of compounds; and 

b) a reagent for assessing expression of a marker 
selected from the group consisting of markers listed in 
Tables 7-10. 

39. A kit for diagnosing multiple sclerosis in a 
subject, the kit comprising an antibody, wherein the 
antibody specifically binds with a protein 
corresponding to a marker selected from the group 
consisting of markers listed in Tables 7-10. 

40. A method of modulating a level of expression of a 
marker selected from the markers listed in Tables 7-10, 
the method comprising providing to diseased cells of a 
subject an antisense oligonucleotide complementary to a 
polynucleotide corresponding to the marker. 

41. A method of modulating a level of expression of a 
marker selected from the markers listed in Tables 7-10, 
the method comprising providing to diseased cells of a 
subject a protein. 
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42. The method of claim 41, wherein the protein is 
provided to the cells by providing a vector comprising 
a polynucleotide encoding the protein to the cells. 

43. A method of modulating a level of expression of a 
marker selected from the markers listed in Tables 7-10, 
the method comprising providing to diseased cells of a 
subject an antibody. 

44. The method according to claim 43, wherein the 
method further comprises a therapeutic moiety 
conjugated to the antibody. 

45. A method of localizing a therapeutic moiety to 
diseased tissue comprising exposing the tissue to an 
antibody which is specific to a protein encoded from a 
marker listed in Tables 7-10. 

46. A method of screening for a test compound capable 
of modulating the activity of a protein encoded from a 
marker listed in Tables 7-10, the method comprising 
combining the protein and the test compound, and 
determining the effect of the test compound on the 
therapeutic efficacy of the protein. 

47. A method of screening for a bioactive agent 
capable of interfering with the binding of a protein 
encoded from a marker listed in Tables 7-10 and an 
antibody which binds to the protein, the method 
comprising : 

a) combining the protein, a bioactive agent and an 
antibody which binds to the protein; and 

b) determining the binding of the protein or 
fragment thereof and the antibody. 
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48. An antibody which specifically binds to a protein 
encoded from a marker listed in Tables 7-10. 

49. The antibody of claim 48, wherein the antibody is 
a monoclonal antibody. 

50. The antibody of claim 49, wherein the antibody is 
a humanized antibody. 

51. A peptide encoded from markers listed in Tables 7- 
10. 

52. A composition comprising the peptide of claim 51. 

53 . A composition capable of modulating an immune 
response in a subject, the composition comprising a 
protein encoded from a marker listed in Tables 7-10, 
and a pharmaceutically acceptable carrier. 

54 . A biochip comprising a panel of markers selected 
from the group of markers listed in Tables 7-10. 

55. The biochip of claim 54, wherein the markers are 
selected for subjects suspected of having secondary 
progressive multiple sclerosis. 

56. The biochip of claim 54, wherein the markers are 
selected for subjects suspected of having primary 
progressive multiple sclerosis. 

57. The biochip of claim 54, wherein the markers are 
selected for subjects suspected of having relaxing- 
remitting multiple sclerosis. 



WO 02/079218 



PCT/US02/09305 



- 171 - 

58. The biochip of claim 54, wherein the markers are 
selected for subjects suspected of having multiple 
sclerosis, wherein the subjects are from a high-risk 
geographic region. 
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BOX D. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

Group 1-808, claim(s) 1-26, in part, drawn to methods of diagnosing a subject with multiple sclerosis by comparing expression of a 
marker listed in Tables 7-10 to normal expression levels or using a panel of markers for expression analysis of multiple sclerosis. It 
is noted that Groups 1-808 correspond to the 808 markers of Tables 7-10. Therefore, the first mentioned invention is the methods of 
claim 1 to the extent that they apply to expression analysis of XAP4 (Accession U67322) and panels of at least five markers wherein 
the pane] comprises XAP4. Group 1, the first mentioned invention, is the invention which will be searched in accordance with 
PCT Article 17(3) (a). Additional groups may be elected. For example, if Group 2 is elected, and the proper fees are paid, then 
claim 1 will be examined to the extent that they apply to methods of expression analysis with respect to OA48-18 (Accession Number 
AF069250) or a panel of five makers comprising said marker recited in claim 1 . Upon election of an invention to be searched in 
addition to group 1 , please identify the number of the gene to be searched in the method claims. 

Group 809-1616, claim(s) 27-35. in part, drawn to methods of assessing the efficacy of therapy or compounds by detecting expression 
of a marker listed in Tables 7-10. 

Group 1617-2424. claim(s) 36-38, 54-58 in part, drawn to kits comprising nucleic acids of Tables 7-10 and biochips. 

Group 2425-3232, claim(s) 36, 39, 48-50 in part, drawn to kits comprising antibodies which binds with a protein corresponding to the 
markers of Tables 7- 10. 

Group 3233-4040. claim(s) 40, drawn to methods of modulating expression using ami sense oligonucleotides . 
Group 4041-4848, dairies) 41-42. drawn to methods of modulating expression using a protein of Tables 7-10. 
Group 4849-5656, claim(s) 43-44, drawn to methods of modulating expression using an antibody of Tables 7-10. 
Group 5657-6464, claim(s) 45, drawn to methods of localizing a therapeutic moiety. 

Group 6465-7272, claim(s) 46-47, drawn to methods of screening for test compounds by monitoring expression of protein of Tables 
7-10. 

Group 7273-8080, claim(s) 51-53. drawn to peptides encoded from markers listed in Tables 7- 10. 

The inventions listed as Groups 1-89 do not relate to a single general inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons. 

The claims are drawn to detecting expression of numerous genes, in the alternative, as diagnostic of multiple sclerosis. It is well 
known in the art that various genes arc differentially expressed in multiple sclerosis cells such that one may detect a predisposition of 
MS based upon overexpression of a gene compared to the normal expression level of the gene. The disclosure specifically states that 
"A study by Whitney and colleagues utilized genetic expression data from one patient to compare MS tissue to non-MS tissue, 
revealing that about sixty- two genes were differentially expressed, including the Duffy chemokine receptor, interferon regulatory 
factor-2 and tumor necrosis factor alpha receptor 2." (page 3, lines 5- 15). Moreover, it is noted that the art teaches the association of 
over expression of many genes in MS (Baranzini, J. Immunology Vol 165, pages 6576-6582, 2000). Therefore, there is no special 
technical feature which links the claims as defined by PCT Rule 13.2, and therefore lack of unity is present between the groups. 

Additionally, methods which are comparing expression of one gene are not the same as method comparing expression of a different 
gene. The expression level of one gene as indicative of MS does not bear any indication of how a different gene will be expressed in 
cancer cells. The molecules, namely genes, which are relied upon in each of the methods do not share a common structure. 

With respect to the markers, namely the polynucleotides and polypeptides, each of the genes provided in Table 7-10 arc identified by 
Accession Number indicating that the sequence of the genes were known in the art. For example, XAP4, Genbank Accession Number 
U67322. available in 1999 contains both a nucleic acid and a polypcpetidc sequence. Therefore, the genes themselves are not 
contributions over the art as exemplified in the Tables themselves. 
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The groups comprising polynucleotides, kits, polypeptides, antibodies are additionally drawn to multiple, distinct products lacking the 
same or corresponding special technical features. The nucleic acids are composed of nucleotides and function in. e.g. , methods of 
nucleic acid hybridization or amplification. These groups are directed to different combinations of nucleic acids which are different 
from one another and may be employed in different methods. The polypeptides differ in both structure and function from either the 
nucleic acids or the transgenic organisms. The polypeptides are composed of amino acids linked by peptide bonds and arranged in a 
complex combination of alpha helices, beta pleated sheets, hydrophobic and hydrophilic domains. The polypeptides also differ in 
function, e.g., fusion proteins with an enzymatic functions. The antibodies arc composed of amino acids linked by peptide bonds, 
antibodies are glycosylated and their tertiary structure is unique, where four subunits (2 light chains and 2 heavy chains) associated 
via disulfide bonds into a Y-shaped symmetric dimer. The antibodies function in immunoassays. As products of different sets of 
Groups differ from each other in structure, function, and effect, they do not belong to a recognized class of chemical compound, or 
have both a "common property or activity" and a common structure as would be required to show that the inventions are "of a similar 
nature". 

The methods are similarly not the same because they rely upon different products and have different objectives, reagents and results. 
Therefore, the methods lack unity. 
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